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Description 



Technical Field 



5 [0001] The present invention relates to a method of diagnostic of the diseases caused by mutations in the LKB1 
gene and to an diagnostic reagent and a therapeutic preparation for said diseases. 

Background Art 

10 [0002] PeutZnJeghers (PJ) syndrome [MIM 1 75200] is an autosomal dominarttly inherited disorder characterized by 
melanotic pigmentation of lips, perioral and buccal regions and by benign, hamartomatous and adenomatous types of 
multiple gastrointestinal polyps. Patients with this syndrome are known to frequently develop benign or malignant neo- 
plasms in the gastrointestinal tract, pancreas, ovaries, testis, breast and uterus. In particular, small benign tumors fre- 
quently occur in the ovary and are developed as multifocal, symmetrical germinal-cord structures with annular tubules. 

15 Then, they progress into granulosa cancer and result in ambisexual precocity in girls. It was found that multifocal ger- 
minal cord cancer in boys, though with low frequency, results in gynecomasty and feminization due to over-production 
of estrogen. Defects in the gene responsible for Peutz-Jeghers syndrome (PJ gene) appear to predispose to a wide 
spectrum of neoplastic diseases. In fact, 50% of the carriers with a defect in one of PJ allele are known to develop can- 
cer by the age of 60 (Giardiello, F.M. et al. Increased risk of cancer in Peutz-Jeghers syndrome. N. Engl. J. Med. 

20 316,1511-1514 (1987); Spigelman, A.D., Murday, V. & Phillips, R.K. Cancer and Peutz-Jeghers syndrome. Gut 30, 
1588-1590 (1989). [MIM 175200D- It is believed that loss or inactivation of the PJ gene product results in disruption of 
the fundamental growth control mechanism within somatic cells that have potential high proliferative capacity, which 
triggers the growth of benign hamartomatous polyps some of which turn into malignant tumor cells after further genetic 
alteration. 

25 [0003] Recently, it was reported that PJ gene was mapped to chromosome 1 9p1 3.3 by linkage analysis in 1 2 fam- 
ilies of PJ syndrome with a multipoint lod score of 7.00 at the microsatellite genetic marker D19S886 (Hemminki, A. et 
al. Localization of a susceptibility locus for PJ syndrome to 19p using comparative genomic hybridization and targeted 
linkage analysis. Nat. Genet 15 (1), 87-90 (1997)). A similar linkage between PJ gene and the genetic marker 
D19S886 was reported in a second study investigating five other families as well (with a multipoint lod score of 7.52) 

30 (Amos, C.I. et al. Fine mapping of a genetic locus for PJ syndrome on chromosome 19p. Cancer Res. 57, 3653-3656 
(1997)). D19S565 was first known as a genetic marker proximal to the PJ gene, which causes recombination with the 
gene. Thereafter, the genetic marker D19S878 was found to be located more proximal to the gene. These two makers, 
therefore, were thought to define the proximal border of the PJ candidate region. In both linkage studies, no recombi- 
nation was observed between the marker D19S886 and PJ gene, indicating that they are located in a narrow interval. 

35 

Disclosure of the Invention 

[0004] An objective of the present invention is to identify the gene responsible for Peutz-Jeghers syndrome and to 
provide a method for diagnosing diseases resulting from a mutation in this gene, a diagnostic reagent and a therapeutic 

40 preparation for the diseases. 

[0005] To identrfiy the gene for PJ syndrome, the present inventors made a continuous cosmid corrtig and a restric- 
tion map for the region extending from 1 .5 Mb within chromosome 19p1 3.3 which includes the PJ gene. EST database 
search for the genes mapped in this region was then performed, and the precise locations of these genes were deter- 
mined. The inventors evaluated biological information of a number of genes thus found and selected several potent can- 

45 didates of the PJ gene. Mutation analysis of these candidate genes in DNAs from patients with PJ syndrome revealed 
that one of the candidate genes, "LKB1 ," was specifically mutated in the patients with PJ syndrome. Thus, the inventors' 
intense investigations successfully identified the gene responsible for PJ syndrome for the first time. Based on this find- 
ing, the inventors found that diseases caused by mutations in the LKB1 gene can be diagnosed and treated by utilizing 
the LKB1 gene, primers and probes based on the sequence thereof, LKB1 protein and antibodies that bind to LKB1 pro- 

so tein. 

[0006] As described above, the present invention relates to a method of diagnosing diseases caused by mutation 
in the LKB1 gene, the gene responsible for PJ syndrome, and to a diagnostic reagent and a therapeutic preparation for 
the diseases. More specifically, the present invention relates to: 

55 (1) a primer DNA used for diagnosing a disease caused by mutation in the LKB1 gene, the primer DNA comprising 
a nucleotide sequence containing at least a portion of any one of the nucleotide sequences shown in SEQ ID NOs: 
1 to 4; 

(2) the primer DNA according to (1), wherein the primer DNA has a nucleotide sequence corresponding to any one 



2 



V 



EP 1 036 844 A1 

of the nucleotide sequences shown in SEQ ID NOs: 7 to 30; 

(3) the primer DNA according to (1) or (2), wherein the disease caused by mutation in the LKB1 gene is Peutz- 
Jeghers syndrome; 

(4) a probe DNA used for diagnosing a disease caused by mutation in the LKB1 gene, the probe DNA comprising 
s a nucleotide sequence containing at least a portion of any one of the nucleotide sequences shown in SEQ ID NOs: 

1 to 4; 

(5) the probe DNA according to (4), wherein the disease caused by mutation in the LKB1 gene is Peutz-Jeghers 
syndrome; 

(6) a therapeutic preparation for a disease caused by mutation in the LKB1 gene, the preparation comprising the 
10 LKB1 gene as an active ingredient; 

(7) a therapeutic preparation for a disease caused by mutation in the LKB1 gene, the preparation comprising the 
LKB1 protein an active ingredient; 

(8) a therapeutic preparation for a disease caused by mutation in the LKB1 gene, the preparation comprising a 
compound that enhances the activity of LKB1 protein as an active ingredient; 

is (9) the therapeutic preparation according to (6) to (8), wherein the disease caused by mutation in the LKB1 gene 
is Peutz-Jeghers syndrome; 

(10) a reagent for diagnostic of a disease caused by mutation in the LKB1 gene, the reagent comprising an anti- 
body that binds to the LKB1 protein as an active ingredient; 

(1 1) the reagent according to (10), wherein the disease caused by mutation in the LKB1 gene is Peutz-Jeghers syn- 
20 drome; 

(12) a method of diagnosing a disease caused by mutation in the LKB1 gene, the method comprising detecting 
mutation in the LKB1 gene; 

(13) a method of diagnosing a disease caused by mutation in the LKB1 gene, the method comprising the steps of: 

25 (a) preparing a DNA sample from a patient; 

(b) amplifying the DNA using the primer DNA according to (1) ; 

(c) cleaving the amplified DNA; 

(d) fractionating the DNA fragments according to their size; 

(e) hybridizing the probe DNA according to (4) with the fractionated DNA fragments; and 

30 (f) comparing the size of the DNA fragment thus detected to that from a control of a healthy subject; 

(14) a method of diagnosing a disease caused by mutation in the LKB1 gene, the method comprising the steps of: 

(a) preparing a RNA sample from a patient; 
35 (b) fractionating the RNA sample depending on its size; 

(c) hybridizing the probe DNA according to (4) with the RNA thus fractionated; 

(d) comparing the size of the RNA thus detected to that from a control of a healthy subject; 

(15) a method of diagnosing a disease caused by mutation in the LKB1 gene, the method comprising the steps of: 

40 

(a) preparing a DNA sample from a patient; 

(b) amplifying the DNA using the primer DNA according to (1) ; 
(C) separating the amplified DNA into single stranded DNA; 

(d) fractionating the separated single stranded DNA on a non-denatured gel; 
45 (e) comparing the mobility of the single stranded DNA separated on the non-denatured gel to that of a control 

of a healthy subject; 

(16) a method of diagnosing a disease caused by mutation in the LKB1 gene, the method comprising the steps of: 

so (a) preparing a DNA sample from a patient; 

(b) amplifying the DNA using the primer DNA according to (1); 

(c) fractionating the amplified DNA on the DNA denatured gradient gel; 

(d) comparing the mobility of the fractionated DNA on the gel to that of a control of a healthy subject; 

55 (17) the method according to any one of (12) to (16), wherein the disease caused by mutation in the LKB1 gene is 
Peutz-Jeghers syndrome. 

[0007] The present invention was made based on the inventors' findings that Peutz-Jeghers syndrome is caused 
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by a mutation in the gene called "LKB1 The present invention primarily relates to the use of polynucleotides containing 
at least a portion of the nucleotide sequence corresponding to the genomic DNA coding LKB1 (including intron, pro- 
moter, and enhancer regions as well as exon regions), and said polynucleotides for diagnosing diseases resulting from 
mutations in the LKB1 gene. The genomic DNA regions of LKB1 are shown in the SEQ ID: 1 to 4. The sequences 

5 shown in SEQ ID: 1 to 4 correspond to 5' upstream region, exon 1 and intron 1 (a part) region, exons 2 to 8 and introns 
1 (a part) to 8 (a part) region, and intron 8 (a part) and exon 9 region of the LKB1 gene, respectively. 
[0008] Nucleotide sequences containing a portion of these regions can be used as primers or probes to diagnose 
the diseases resulting from mutations in the LKB1 gene. A nucleotide used as a primer is typically 15 to 100 bp. prefer- 
ably 1 7 to 30 bp. Any primer can be used as long as it amplifies at least a portion of the LKB1 gene or regions regulating 

w the gene expression. Such regions include, for example, exon, intron, promoter, and enhancer regions of the LKB1 
gene. On the other hand, nucleotides used as a probe typically have a sequence of at least 15 bp or more when the 
nucleotides are synthetic oligonucleotides. Double stranded DNA obtained from a done into which a vector such as 
plasmid DNA is incorporated can be used as a probe. For a region utilized as a probe, any part of the LKB1 gene or 
region regulating the expression thereof can be used. Such regions include, for example, exon, intron, promoter and 

is enhancer regions of the LKB1 gene. When used as a probe, oligonucleotide or double stranded DNA is used after ade- 
quately labeling. Labeling methods, for example, include labeling of the 5 -terminus by phosphorylating with 32 P using 
T4 polynucleotide kinase, and labeling by incorporation of substrate bases labeled with isotopes such as 32 P, fluoro- 
chrome or biotin using DNA polymerases such as Klenow enzyme and primers such as random hexamer oligonucle- 
tides (random-primer method). 

20 [0009] The diseases detectable using these nucleotides, which are caused by mutations in the LKB1 gene are not 
limited to Peutz-Jeghers syndrome. Any disease caused by mutations in the LKB1 gene is included. PTEN and APC 
genes were discovered as causative genes for Cowden's disease, which is one of hereditary cancers, and Familial Ade- 
nomatous polyposis (FAP), respectively. Both genes proved to be mutated with high frequency in the non-hereditary 
common cancers. PTEN and APC genes function to control cell proliferation in the normal tissues, and it is believed that 

25 a critical step in tumorigenesis occurs when these genes mutate and lose their functions resulting in cells escaping from 
the regulation of these genes. Similar to PTEN and APC genes, a mutation in the LKB1 gene possibly plays a part in 
common tumorigenesis. 

[0010] The diagnostic method of the diseases caused by mutations in the LKB1 gene in the present invention fea- 
tures detection of mutations in the LKB1 gene. In the present invention, by the term "the diagnosis of the diseases 

30 caused by mutations in the LKB1 gene" is meant not only the testing of patients who have developed the particular 
symptoms resulting from a mutation in the LKB1 gene, but also testing a mutation in the LKB1 gene for determining 
whether the subject has a predisposition to the particular disease resulting from a mutation in the LKB1 gene. A muta- 
tion in one of allele of the LKB1 gene is thought to largely increase the risk for a particular disease caused by mutations 
in the LKB1 gene even if the symptom has not apparently been developed. The present invention also includes a diag- 

35 nostic method for identifying patients who have a mutation in one of allele of the LKB1 gene (carriers). "Detection of a 
mutation in the LKB1 gene" in the present invention includes detection in DNA, RNA and protein. 
[0011] One embodiment of the diagnostic method of the present invention is a method of directly determining the 
nucleotide sequence of the LKB1 gene of a patient. For example, sequencing is performed after amplifying the whole 
or a partial sequence of the LKB1 gene of a patient using a technique such as PCR (Polymerase Chain Reaction), using 

40 the nucleotides described above as primers, and the DNA isolated from a patient who is suspected of being afflicted 
with a disease caused by a mutation in the LKB1 gene, as template. The sequence thus determined can be used to 
diagnose the diseases caused by mutations in the LKB1 gene by comparing it to the sequence of the LKB1 gene from 
healthy subjects. 

[0012] In addition to the methods as described above in which DNA from a patient is directly sequenced, different 
45 methods can be used as diagnostic methods of the present invention. One of such embodiments comprises the steps 
of (a) preparing DNA samples from patients, (b) amplifying the DNA from the patients using the primer DNA of the 
present invention, (c) separating the amplified DNA into the single stranded DNA, (d) fractionating the single stranded 
DNA separated on non-denaturing gel, and (e) comparing the mobility of the single stranded DNA separated on the gel 
to that of a control of ordinary person. 
so [0013] Such methods include PCR-SSCP (single-strand conformation polymorphism; polymorphism of single- 
stranded DNA in higher-order structure) method (Cloning and polymerase chain reaction-single-strand conformation 
polymorphism analysis of anonymous Alu repeats on chromosome 1 1 . Genomics. 1992 Jan 1 ; 12(1): 139-146., Detec- 
tion of p53 gene mutations in human brain tumors by single-strand conformation polymorphism analysis of polymerase 
chain reaction products. Oncogene. 1991 Aug 1; 6(8): 1313-1318., Multiple fluorescence-based PCR-SSCP analysis 
55 with postlabeling., PCR Methods Appl. 1995 Apr 1 ;4(5):275-282.). Having the advantage of relatively simple manipula- 
tion and smaller sample volume requirement, this method is particularly suitable for screening a large number of DNA 
samples. The principles of the method are as follows: when double stranded DNA fragments are denatured into single 
stranded DNA, each single stranded DNA forms an original higher-order structure peculiar to its nucleotide sequence; 
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these denatured DNA, even if they are complementary to each other and have the same chain-length, migrate to dis- 
tinct positions according to their conformation when electrophoresed on a non-denaturing polyacrylamide gel; the con- 
formations of these single stranded DNA are altered by substitution of a single nucleotide and the substituted DNA 
migrate with different mobility by electrophoresis on polyacrylamide gel; and thus, detection of the alteration in the 
5 mobility enables the detection of mutations such as point mutation, deletion or insertion in the DNA fragment of the 
interest. 

[0014] In the PCR-SSCP method, the whole or a portion of the LKB1 gene is initially amplified using a technique 
such as PCR. A nucleotide sequence ranging in length from about 200 to 400 bp is typically preferred as a fragment to 
be amplified. Portions to be amplified include an exon, intron, promoter and enhancer of the LKB1 gene. PCR can be 

10 performed under the conventional conditions (for example, the conditions used for amplification of each exons using the 
primers shown in example 5). On amplifying the gene fragment by PCR, DNA fragments to be synthesized in the PCR 
reaction may be labeled with primers labeled with an isotope such as 32 P, f luorochrome or biotin, or by adding substrate 
nucleotides labeled with an isotope such as 32 P, f luorochrome or biotin in the PCR reaction solution. The said DNA frag- 
ments may also be labeled by adding substrate nucleotides labeled with an isotope such as 32 P, f luorochrome or biotin 

15 to the synthesized DNA fragments by using Klenow enzyme after the PCR reaction. The labeled DNA fragments thus 
obtained are denatured by heat, for example, aid electrophoresed on a polyacrylamide gel without any denaturing 
agent, such as urea. As to the electrophoresis, conditions for fractionating DNA fragments can be improved by adding 
an appropriate amount of glycerol (5 to 10%) to polyacrylamide. Conditions for electrophoresis vary depending on the 
nature of each DNA fragment. Electrophoresis is typically performed at room temperature (20 to 25°C), but when pref- 

20 erable fractionation is not obtained, in the temperature ranging from 4 to 30°C, it is better to test the optimal mobility to 
determine the temperature which gives the most desirable mobility. After the electrophoresis, mobility of the DNA frag- 
ment is detected and analyzed by autoradiography using X-ray films and by scanner for detection of fluorescence. 
When the bands with different mobility are detected, those bands are directly dissected from the gel, re-amplified by 
PCR, and subjected to direct sequencing to confirm the mutation. In case where DNA fragments synthesized by PCR 

25 are not labeled, the bands of said DNA fragments may be detected using staining techniques such as ethidium bromide 
or silver staining. 

[001 5] Another embodiment of the diagnostic methods of the present invention comprises the steps of (a) preparing 
a DNA sample from a patient, (b) amplifying the DNA derived from the patient using the DNA primers of the present 
invention, (c) cleaving the DNA thus amplified, (d) fractionating the DNA fragments depending on their size, (e) hybrid - 
30 izing the DNA fragments thus fractionated with the probe DNA of the present invention, and (f) comparing the length of 
the DNA fragment thus detected to that from healthy subjects. 

[0016] These methods include the methods utilizing restriction fragment length polymorphism (RFLP) and PCR- 
RFLP method. These methods are based on the principle that when a mutation has occurred in the recognition site for 
a restriction enzyme or when insertion or deletion of bases has occurred in the DNA fragments generated by treatment 

35 with a restriction enzyme, the length of those fragments treated with the restriction enzyme generally deviates from that 
of normal subjects. In fact, for Peutz-Jeghers syndrome, in the LKB1 gene of the four patients shown as samples, D, B, 
MA and FA, with Peutz-Jeghers syndrome, acquisition of Seal site, elimination of Ahdl, Rsal, and BsrBI site had 
occurred, respectively (Table 3). Therefore, these mutations can be detected as the differences of bands' mobility after 
electrophoresis, which was performed after the portions containing these mutations are amplified by PCR and then 

40 treated with the restriction enzymes mentioned above. Alternatively, the mutations can be detected by subjecting DNA 
from patients to southern blotting using the probe DNA of the present invention, after the DNA is treated with these 
restriction enzymes and electrophoresed. Restriction enzymes used other than those mentioned above are properly 
selected depending on each mutation. In this method, besides detecting the mutations via restriction enzyme treatment 
of genomic DNA prepared from patients, cDNA prepared by the treatment of RNA prepared from patients with reverse 

45 transcriptase is directly treated with the restriction enzyme and then can be used for southern blotting to detect muta- 
tions. This cDNA can also be used as a template for PCR, and the amplified product thereof (the whole or a portion of 
the LKB1 gene) can be digested with a restriction enzyme and then electrophoresed to detect mutations shown as dif- 
ference of mobility of the DNA fragments. 

[0017] RNA prepared from patients may be used for the detection instead of DNA. Such a method comprises the 
50 steps of (a) preparing an RNA sample from a patient, (b) fractionating the prepared RNA depending on the size, (c) 
hybridizing the RNA thus fractionated to the DNA probe of the present invention, and (d) comparing the size of the RNA 
fragment thus detected to that from normal subjects. Specifically. RNA prepared from a patient is electrophoresed and 
subjected to northern blotting to detect the difference of mobility. 

[0018] Other embodiments of the diagnostic methods of the present invention comprise the steps of (a) preparing 
55 a DNA sample from a patient, (b) amplifying the DNA derived from the patient using the primers of the present inven- 
tion, (c) fractionating the amplified DNA by electrophoresis on the denaturing gradient gel, (d) comparing the mobility of 
the DNA on the gel fractionated to that from normal subjects. 

[0019] A similar method includes denaturant gradient gel electrophoresis (DQGE) method. In this method, the 
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whole or a portion of the LKB1 gene is amplified by PCR using primers of the present invention, for example, and elec- 
trophoresed on the polyacrylamide gel in which the concentration of a denaturant such as urea gradually increases as 
the DNA migrate through the gel. The mobility of the DNA is compared to that of normal subjects. If a DNA fragment 
contains a mutation, it comes to be melted into single stranded DNA in the point of lower denaturant concentration and 

5 is remarkably retarded in mobility, and detection of such difference of mobility allows detection of a mutation. 

[0020] Besides these methods, allele specific oligonucleotide (ASO) hybridization method can be used for the pur- 
pose of only detecting only a mutation in a specific position. In this method, an oligonucleotide containing sequence 
supposed to have a mutation is synthesized and hybridized with a DNA sample. If the DNA sample has a mutation, the 
efficiency of hybridization decreases, and this decrease is detected using techniques such as southern blotting and the 

10 method of utilizing the florescence quenching property of specific f lorescent reagents, which are quenched when inter- 
calated into the gap between the hybrids. 

[0021] Ribonuclease A mismatch cleavage may also be used for the detection. In this method, the whole or a por- 
tion of the LKB1 gene is amplified by PCR, for example, and the amplified DNA is subjected to hybridization with labeled 
RNA prepared from LKB1 cDNA or and such, which is inserted into a plasmid vector and such. This hybrid forms single 
15 stranded structures at the sites of mutation, and said sites may be cleaved by ribonuclease A and then detected using 
some method such as autoradiography. Presence of mutations, can be thus detected. 

[0022] The present invention relates to a diagnostic reagent for diseases caused by mutations in the LKB1 gene, 
wherein the reagent comprises, as an active ingredient, an antibody, which binds to the LKB1 protein. Antibodies, which 
bind to LKB1 protein, can be prepared according to methods well known in the art. As for polyclonal antibodies, for 

20 example, a small animal such as a rabbit is immunized with the LKB1 protein (a natural protein as well as a recombinant 
KLB1 protein expressed in suitable host cells (Sco//, yeast or mammalian cells), such as LKB1 protein expressed as a 
fusion protein with GST in E.coli) or its partial peptide(for example, peptide composed of amino acid sequence shown 
in the SEQ ID: 31 or 34) to obtain antiserum, which then can be purified and prepared using a method such as ammo- 
nium sulfate precipitation, protein A, protein G column, DEAE ion exchange chromatography, affinity columns coupled 

25 with LKB1 protein or synthetic peptide. For monoclonal antibodies, for example, a small animal such as a mouse is 
immunized with LKB1 protein or a part of the peptide, and then dissected to remove the spleen. The spleen is homog- 
enized to isolate the cell fraction. These cells are fused with mouse myeloma cells and a reagent such as polyethylene 
glycol to create fusion cells (hybridomas). From the hybridoma cells thus generated, an appropriate clone which pro- 
duces an antibody which binds to LKB1 protein is selected. Subsequently, the hybridoma cells thus obtained are intra- 

30 perrtoneally transferred into a mouse, ascites is harvested from the same mouse, and the monoclonal antibody thus 
obtained can be prepared by purifying by means of, for example, ammonium sulfate precipitation, protein A, protein G 
column, DEAE ion exchange chromatography, affinity columns coupled with LKB1 protein or synthetic peptide. 
[0023] When used as a reagent, the antibody is, if necessary, mixed with sterile water, physiological saline, vege- 
table oils, surface active agent, lipids, solubility increasing agent, stabilizers (e.g. BSA and gelatin), and preservatives 

35 and such. In a test using said antibody, a tissue or cells from a patient are stained using a method such as enzyme- 
labeled or fluorescence-labeled antibody technique to detect deficiency, aberrant accumulation, or unusual intracellular 
distribution of LKB1 protein. Alternatively, the protein, which is fractionated from cell extracts prepared from a tissue or 
cells from patients with Peutz-Jeghers syndrome using a method such as SDS-PAGE, is transferred onto a membrane 
such as nitrocellulose or PVDF, and detected using a staining method such as enzyme-labeled technique above (west- 

40 ern blotting, immunoblotting). 

[0024] By constructing a detailed physical map of 19p13.3 region, the inventors revealed that the disease-related 
gene LKB1 for Peutz-Jeghers syndrome is located in the close proximity of a microsatellite marker D19S886, i.e. their 
distance on chromosome is about 190 kb. Therefore, the loss of heterozygosity (LOH) test utilizing D19S886 marker 
may effectively serve as a test for diagnosing various diseases based on mutations in the LKB1 gene. 

45 [0025] The present invention also relates to a therapeutic preparation for diseases caused by mutations in the LKB1 
gene. In one embodiment, it comprises the LKB1 gene as an active ingredient. When the LKB1 gene is used as a ther- 
apeutic preparation, the whole or a portion of the genomic LKB1 DNA, or the LKB1 cDNA (SEQ ID: 5) is incorporated 
into an appropriate vector, such as adenovirus vector, adenoassociated virus vector, retrovirus vector, and plasmid 
DNA, and administered orally, intravenously, or topically to the patient. As a method of administration, ex vivo adminis- 

so tration can be used as well as in vivo administration. In administration of a drug, enclosing the gene into a liposome 
generated by micellization of phoshpholipids can enhance the mobility and intake of the gene into the tissue. Alterna- 
tively, cationic lipids may be added to form a complex with DNA, which can enhance the mobility and intake of the gene 
into the tissue. Using these methods, the LKB1 gene mutated in the patient can be substituted by a normal gene, or the 
normal gene can be additionally administered to the patient, resulting in the possible treatment of a disease caused by 

55 a mutation in LKB1 gene. 

[0026] Another embodiment of a therapeutic preparation for the diseases caused by mutations in the LKB1 gene, 
comprises the LKB1 protein as an active ingredient. The LKB1 protein may be prepared as a natural protein or a recom- 
binant protein by utilizing a recombinant DNA technology The amino acid sequence of the LKB1 protein is shown in 
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SEQ ID: 6. A natural protein may be isolated using well-known methods. For example, it can be isolated from the cul- 
tured cells of the testis, fetal liver or K562 cell, in which the LKB1 protein is expressed at a high level, and by affinity 
column chromatography using an antibody against a partial peptide of the LKB1 protein described in Example 7. On 
the other hand, a recombinant protein can be prepared by culturing cells transformed by DNA (SEQ ID: 5) encoding 
5 LKB1 protein. Cells that can be used to produce a recombinant protein include mammalian cells, such as COS, CHO, 
and NIH3T3 cells; insect cells, such as Sf9 cells; yeast ; and E. coli. The vectors suitable for expressing a recombinant 
protein intraceltularly depend on the host cells, for example, pcDNA3 (Invitrogen) or pEF-BOS (Nucleic Acids. Res. 
1990, 18(17), p.5322) vector is used for mammalian cells; "BAC-to-BAC baculovirus expression system" (GIBCO BRL) 
for insect cells; "Pichia Expression Kit" (Invitrogen) for yeast; and pGEX-5X-1 (Pharmacia) and "QIAexpress system" 
10 (Quiagen) for E. coli. The vectors can be introduced into the host cells by a well-known method such as the calcium 
phosphate method, DEAE dextran method, the method using cationic liposome DOTAP (Boehringer Mannheim) or 
SuperFect (Quiagen), electroporation, and calcium chloride. A recombinant protein thus obtained can be purified using 
a conventional technique, for example, the method described in "The Qiaexpressionist handbook, Quiagen, Hilden, 
Germany". 

15 [0027] When the LKB1 protein obtained is used as a therapeutic preparation for diseases caused by mutations in 
the LKB1 gene, the LKB1 protein can be directly administered, or can be given after formulating by a well-known phar- 
maceutical process. For example, formulations may be administered in proper combination with a pharmaceutically 
acceptable carrier or medium, such as sterile water, physiological saline, vegetable oils, surfactants, lipids, dissolving 
adjuvants, stabilizers, or preservatives. While the dosage for administration differs depending on various factors, such 

20 as weight, age, and health conditions, or the method of administration, a person skilled in the art will be able to advan- 
tageously select the appropriate dosage. Typically, the dosage is in the range of 0.01 to 1 000 mg/kg. Administration can 
be conducted, for example, orally, intravenously, intramuscularly or subcutaneously. 

[0028] A person skilled in the art can easily carry out substitution, deletion, addition and/or insertion of amino acids 
in the amino acid sequence of the LKB1 protein for the purpose of improving the activity and stability of the drug of the 
25 present invention, utilizing a well-known method, such as PCR-based site-directed mutagenesis (GIBCO-BRL, Garth - 
ersburg, Maryland), site-directed mutagenesis using oligonucleotides (Kramer, W. and Fritz, HJ (1987) Methods in 
Enzymol., 154:350-367), the Kunkel's method (Methods Enzymol. 85, 2763-2766 (1988)). A similar altered LKB1 pro- 
tein can also be used as a therapeutic preparation of the present invention. 

[0029] Another embodiment of a therapeutic preparation for diseases caused by mutations in the LKB1 gene, the 
30 therapeutic preparation comprises a compound, which enhances the activity of the LKB1 protein, as an active ingredi- 
ent. The LKB1 gene encodes a serine threonine kinase that shows as high as 82% homology with XEEK1 serine thre- 
onine kinase from Xenopus. A collapse of the serine threonine kinase activity of the LKB1 protein is closely associated 
with the emergence of diseases resulting from mutations in the LKB1 gene. Therefore, it is envisaged that enhance- 
ment of the said serine threonine kinase activity will serve to treat diseases caused by mutations in the LKB1 gene. 
35 [0030] A screening method for a compound that enhances the activity of the LKB1 protein is as follows. For exam- 
ple, LKB1 proteins expressed in E.coli as a fusion protein with GST, or those expressed in mammalian or insect cells, 
are used to determine kinase activities of these proteins in the presence of the test compound, to select a compound 
that enhances the activity of the LKB1 protein. 

[0031] More specifically, for example, phosphorylation activity of a substrate protein for the LKB1 protein, or auto- 
40 phosphorylation activity of the LKB1 protein may be determined by measuring the transfer of 32 P from [y- 32 P]ATP to the 
substrate in an appropriate reaction solution (e.g. 50 mM Tris-HCI, pH 7.2, 1 mM dithiothreitol (DTT), 10 mM MgCI2, 10 
mM MnCI 2 , and so on), using a device, such as a liquid scintillation counter, and thus a compound which enhances the 
activity of LIB1 protein can be isolated by selecting a compound that increass the 32 P transfer level. Like the LKB1 pro- 
tein used as therapeutic preparation as described above, the isolated compound may be formulated using well-known 
45 pharmaceutical processes to be administered for treatment of a disease. Typically, the dosage is in the range of 0.01 to 
1000 mg/kg. 

[0032] In addition to the methods as described above, a method utilizing the regions regulating the LKB1 gene 
expression, or a factor binding to the gene can be used for treatment of the diseases caused by mutations in the LKB1 
gene. The present invention has revealed the structure of the LKB1 gene and the 5' upstream region thereof (SEQ ID: 

so 1). This 5* upstream region may contain the regions regulating the LKB1 gene expression (e.g. promoters and enchan- 
cers), and a person skilled in the art could easily specify genetic regions regulating the LKB1 gene expression, using 
several known methods in combination. A method for specifying the region regulating gene, for example, comprises the 
following steps: (a) constructing a vector in which a reporter gene is joined to downstream of the 5 f upstream region of 
the LKB1 gene (DNA composed of the whole or a portion of sequence shown in SEQ ID:1); (b) introducing said vector 

55 into appropriate cells; and (c) detecting the activity of the reporter gene. Specifically, the upstream region of the LKB1 
gene is cleaved into appropriate sized fragments, for example, by various restriction enzymes, and these fragments are 
integrated into the upstream site of a reporter gene, such as a firefly luciferase. secretory alkali phosphatase, or chlo- 
ramphenicol acetyltransferase (CAT) gene to construct expression vectors (PicaGeneTM Vector, Wako Pure Chemicals 
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Industries. Ltd). Subsequently, these expression vectors are introduced into appropriate host cells, such as COS, 
HEK293, and CHO cells, and then incubated for a certain interval. After the incubation, the intracellular and extracellu- 
lar reporter gene product expression is separately measured to determine the promoter activity of the individual gene 
fragment integrated into the vector. Once a gene fragment showing the promoter activity is identified, such a fragment 

5 may further be cleaved into smaller fragments and subjected again to the same process as described above to define 
the active site to a more specified region. To finally confirm the active site, the nucleotide sequence of the region spec- 
ified as the active site may be altered by, for example, site-directed mutagenesis and the activity is measured. The 
region regulating the LKB1 gene expression is particularly useful in the gene therapy described above, since it can 
direct the expression of the LKB1 gene in vivo, under natural expression control, when this region is joined to the 

w upstream of the normal LKB1 gene described above and then administered to a patient whose LKB1 gene is mutated. 
[0033] In addition, once the promoter region is defined in the upstream of the LKB1 gene, screening for compounds 
that can regulate the LRB1 gene expression may be easily facilitated by investigating effects on the production of the 
reporter gene product using reporter gene expression vectors with this site and various compounds. Such screening 
method comprises the steps of: (a) constructing a vector in which a reporter gene is joined to downstream of the pro- 

15 moter region of the LKB1 gene; (b) introducing said vector into appropriate cells; and (c) detecting the activity of the 
reporter gene by exposing a test compound to the said cells and/or introducing the compound into the said cells. Test 
compounds include, but are not limited to, proteins, peptides, synthetic compounds, natural compounds, genes, and 
gene products and such. 

[0034] Screening for compounds that can regulate the LKB1 gene expression may be carried out by exposing a test 
20 sample to the promoter region and selecting the compound (ag. a protein) that binds to said promoter region. For 
instance, transcriptional regulatory factors which control the LKB1 gene expression and bind to this promoter can be 
purified using affinity-purification by, creating a synthetic oligo-DNA and such containing the promoter sequence, bind- 
ing this to a suitable supporting-agent, such as cellulose, and exposing it to a cell-extract and such. 
[0035] Additionally, the inventors have revealed that neoplasia, such as polyps, develop as a result of a mutation in 
25 the LKB1 gene in the patients with Peutz-Jeghers syndrome. This finding lead to a notion decreasing in the amount or 
activity of the LKB1 protein can render normal cells temporary cell proliferation activity. Therefore, the artificial reduction 
in the amount or activity of the LKB1 protein, which is achieved by utilizing anti-sense DNA for the LKB1 gene or the 
cDNA thereof, or by utilizing a compound which inhibits the activity of the LKB1 gene, will possibly serve to treat the 
diseases which require fresh cell proliferation, such as wound curing and anagenesis. 

30 

Brief Description of the Drawings 
[0036] 

35 Figure 1a illustrates the chromosomal organization of the LKB1 gene and the adjacent regions together with the 
restriction enzyme map, wherein R indicates EcoRI site; B, BamHV, S, Sad; K t Kpn\\ BssH, SssHII. Nine exons of 
the LKB1 gene were indicated by boxes. Closed and open boxes represent the translated and untranslated regions, 
respectively. Figure lb shows a schematic illustration of gene rearrangement found in Peutz-Jeghers patient A. The 
upper part of the panel shows the organization of the normal-type LKB1 gene; the bottom part shows the organi- 

40 zation of the mutant-type gene from Peutz-Jeghers patient A. The orientations and the positions of the primers 
used in PCR analysis are shown by arrows. 

Figure 2 shows a corrtig of cosmid clones in the region neighboring to the position of microsatellite marker 
D19S886, and the map of EcoRI site in the region. The bottom shows a schematic illustration of positional relation- 
ship between the LKB1 gene and microsatellite marker D19S886 in the region of human chromosome 19p13.3; tel 
45 denotes the direction to the telomere; cen, the direction to the centromere. The upper part of the panel shows the 
partially overlapping, arranged cosmid clones. 

Figure 3a displays the pedigree of patient A expanding three generations. The symbol in which the right half is filled 
indicates an affected member. Figure 3b shows the result obtained by long distance PCR with primers, DJ666 and 
DJ660. An aberrant type of PCR products of 2.5kbp, which is indicated by a filled triangle, is observed only in the 
so affected family members. Figure 3c shows the result obtained by PCR analysis with primers. DJ666 and DJ684. 
The amplification of the PCR products is observed only in the affected members, which indicates that an inversion 
exists in the corresponding region. 

Figure 4 shows the pattern of agarose gel electrophoresis of LKB1 exons amplified by PCR. Exon Numbers are 
indicated on the top. MW represents the molecular weight marker. 
55 Figure 5 shows an electrophoretic pattern obtained by PCR-RFLP analysis. The marks, Ahdl, BsrBI, Rsal, and 
Seal on the pattern indicate the samples treated with the corresponding restriction enzymes shown. WT denotes a 
DNA sample from a healthy normal person; B. Peutz-Jeghers patient B; FA, Peutz-Jeghers patient FA; MA, Peutz- 
Jeghers patient MA; D, Peutz-Jeghers patient D. MW indicates the molecular weight marker. 
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Figure 6 shows the autophosphorylation activity of LKB1 protein investigated in the presence of various divalent 
cations; the protein was immunoprecipitated. The result shows that the autophosphorylation activity of LKB1 
(kinase activity) is hardly enhanced by Mg 2 * but strongly enhanced by Mn 2+ . 

Figure 7 shows the autophosphorylation activities of the wild-type LKB1 protein and the various mutants thereof. 
5 The top panel displays the autoradiography pattern showing autophosphorylation activity diagnosed by autoradiog- 

raphy; the bottom shows the results of a Western blotting stained with an anti-c-Myc antibody Each protein is 
observed to be to be produced at a similar level. 

Figure 8 shows an electrophoretic pattern of a GST-fusion of LKB1-myc protein expressed in E, coli, which was 
analyzed by Western blotting. M indicates the molecular weight marker; 1 . a iysate of the E. coli cells prior to the 

10 induction by IPTG; 2, a Iysate of the E. coli cells after the induction by IPTG; 3, the GST-fusion of LKB1 -myc protein 
purified with glutathione Sepharose. The filter labeled with "Anti-Myc antibody" was stained using the anti-myc anti- 
body; the one labeled with "Anti-LKB1 antibody" was stained with an affinity-purified anti-LKB1 peptide antibody. 
Figure 9 shows the result of Western blotting using the respective antibodies. Lane 1 contains a sample Iysate from 
COS7 cells with transfected DNA of pcDNA3 vector alone; lane 2, a Iysate from COS7 cells with transfected 

15 pcDNA3/LKB1 myc; lane 3, a Iysate from HeLa S3 ceils. The samples were stained with the antibodies indicated at 
the left and the anti-c-Myc antibody. Peptides used in the preincubation treatment are shown on the top. 
Figure 10 shows a tissue section of human fetal colon stained using anti-LKB1 P3 antibody. The cytoplasm of the 
epithelial cells is positive in the stain. The cells, which are assumed to be endocrine cells, are also stained very 
intensely. 

20 Figure 1 1 shows a tissue section of human adult pancreas stained with the anti-LKB1 P3 antibody. The islet cells 
are positive in this staining. 

Figure 12 shows a tissue section of human fetal testis stained with the anti-LKB1 P3 antibody. The undeveloped 
germ cells were stained intensely. 

25 Best Mode for Carrying out the Invention 

[0037] The present invention will be described further in detail by referring to the examples below, but is not to be 
construed as being restricted thereto. 

30 Example 1 . Construction of a cosmid clone corrtig covering a region on human chromosome 19p1 3.3 

[0038] In order to identify efficiently all the genes located in the region of human chromosome 19p13.3. a contig of 
contiguous clones (contig of arranged clones) assembled with cosmid clones and other clones was constructed. Suit- 
able cosmid clones were selected from a group of clones mapped in this region by FISH, having short and different 

35 overlapping regions; and gaps between the clones were successively filled with clones obtained by screening cosmid, 
phosmid and BAC libraries (cosmid walking, chromosome walking). A high-resolution physical map was, thus, prepared 
by constructing a contig of the contiguous clones covering a 1 .5Mbp region on chromosome 19pter. 
[0039] Specifically, the contig was constructed as described below. A short contig of cosmid clones covering a 
region on chromosome 19p13.3, which was mapped and alligned previously by FISH (Ashworth, L K. et al. An irrte- 

40 grated metric physical map of human chromosome 19. Nature Genet. 1 1 , 422-427 (1995)), was further extended using 
genomic clones obtained from chromosome 19-specific cosmid libraries with Lawrist5 vector as well as Lawristl 6 vec- 
tor, a chromosome 19-specific phosmid library with pBeioBacH vector, and an entire human genome cosmid library 
(Stratagene) as well as an entire human genome BAC library (Research Genetics). The cosmid walking was started 
from the ends of two contigs each containing the PRNT3 gene locus (Zimmer, M. et al. Three human elastase-iike 

45 genes coordinately expressed in the myelomonocyte lineage are organized as a single genetic locus on 19pter. Proc. 
Natl. Acad. Sci. U.S.A. 89, 8215-8219 (1992)) or the GZMM gene locus (Pilat, D. et al. The human Met-ase gene 
(GZMM): structure, sequence and close physical linkage to the serine protease gene cluster on 19p13.3. Genomics, 
24, 445-450 (1994)) on 19p13.3. Specifically, overlapping clones were successively obtained by screening using the 
EcoRI fragments derived from the ends of the contigs as probes; the high-density filters of the cosmid and phosmid 

so libraries were treated with Cot1 DNA (New England Biolabs) for background reduction. In some cases, nucleotide 
sequences of the insert ends were determined directly using the cosmid clones as sequence templates; the primers 
used were Uawrist vector-specific primers, DJ180 (SEQ ID NO: 35/CGACTCACTATAGGGAGACCCA) and DJ181 
(SEQ ID NO: 36/CCTCGAG AATTACCCTCACTAA) . The sequences determined were utilized: (i) to search the data- 
bases for the identification of novel genes exhibiting significant homologies to known genes; (ii) to prepare STS 

55 (Sequence Tagged Site) which is used for further experiments of chromosome walking. Thus, the contig was assem- 
bled from cosmid clones covering a 1.5Mbp region near the telomere on chromosome 19. Restriction sites of BamH\, 
SssHII, EcoRI, H/ndlll, Kpn\ t and Sac I were mapped on the cosmid clones, to prepare a restriction enzyme map of the 
region. More specifically, the map construction was carried out by the following procedures: the cosmid DNAs were lin- 
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earized by treatment with lambda terminase and by partial digestion using restriction enzymes; the cos sites of the 
DNAs were labeled, and subsequently digested with restriction enzymes; the map was made based on the length of 
each restriction fragment (cos-site labeling method; Rackwitz, H. R., et al. Analysis of cosmids using linearization by 
phage lambda terminase. Gene 40, 259-266 (1985)). 

5 

Example 2. Identification of genes mapped on the cosmid clone corrtig on 19p3.3 

[0040] Previously, by radiation hybrid mapping method, 91 clones of EST (Expressed Sequence Tag) have been 
mapped in a region more proximal to the telomere than the genetic marker, D19S216, on 19p3.3. Clones of EST over- 

10 lapping one another were removed by analyzing the sequences using a software program, UNIGENE, 
(ht^y/www.ncbi.nlm.nih.gov/UniGene/index.html), and as a consequence, 60 clones of EST were finally identified to be 
derived from independent genes. Primer sets for amplifying the respective ESTs were purchased from Research 
Genetics; the cosmid clones belonging to the previously constructed contig on 19p3.3 were categorized into four non- 
overlapping groups; then the locations of the respective genes were assigned on the contig by PCR screening. When 

15 an EST gave the positive signal, then the databases were searched for other overlapping ESTs to extend the sequence 
of the EST Complementary DNAs corresponding to the respective positive ESTs were purchased from Research 
Genetics; using the inserts of the cDNAs as probes, Eco Rl-digested cosmid clones were analyzed by Southern hybrid- 
ization, to determine the accurate positions of the respective genes in the contig. 

[0041] "me accurate position of genetic marker D19S886 (GenBank accession number: Z52881) was also deter- 
20 mined in this cosmid clone contig. Specifically, a DNA fragment containing the marker sequence was prepared from the 
human total DNA by PCR amplification using the sense primer (SEQ ID NO: 37/TGGATCTACACTCCGGC) and the 
antisense primer (SEQ ID NO: 38/ATTTTACTGGCTGGC ACTTG) ; the cosmid clone, R32184, was digested with 
Eco Rl, Sacl, BamHI, or Bss HII, and then the digest was subjected to Southern-blot analysis using the DNA fragment 
as a probe, to determine the accurate position of the marker on the Eco Rl map. 
25 [0042] Moreover, mouse genes mapped previously in a region of mouse chromosome 10, which corresponds to 
human 19p, were mapped on this contig by similar procedures. Furthermore, new genes contained in this contig were 
identified by searching databases such as LLNL database [http7Awww-bio.llnl.gov] with the sequences of the cosmid 
ends. 

30 Example 3. Identification of a candidate region containing the Peutz-Jeghers gene 

[0043] Earlier linkage studies have reported that a candidate region for the Peutz-Jegners gene may be located 
more proximally to the telomere than markers D19S878 and D19S565; however there was no reported marker located 
on the telomere side of these markers at all, and accordingly the candidate region was assumed to span over a genomic 

35 region of as long as approximately 2Mb or more on the telomere side of the markers. Nonetheless, there was the pos- 
sibility that the mutant gene was located very near this marker, considering the close linkage of D19S886 marker and 
the Peutz-Jeghers gene locus. Thus, the accurate position of marker D19S886 was first determined in the cosmid con- 
tig as described above, and then the neighboring genes were diagnosed. The result showed that there were 21 genes 
within the region of 400kbp neighboring to marker D19S886. Candidates for the causative gene of Peutz-Jeghers syn- 

40 drome were selected from the newly identified genes, based on a wide variety of biological information such as infor- 
mation obtained by the analyses of the counterpart genes of other species. The samples of chromosomal DNA 
prepared from Peutz-Jeghers patients were diagnosed for nucleotide mutations within the respective genes selected. 

Example 4. Identification of the LKB1 gene and the analysis of genomic organization thereof 

45 

[0044] The GenBank database was searched for the sequences exhibiting homologies to cosmid clone R291 1 4 at 
its end on the telomere side, which was contained in the contig spanning over a region on 19p3.3; the result showed 
that the cosmid end shared a short sequence of 32bp with a cDNA (GenBank accession number: U63333) encoding a 
novel human serine-threonine protein kinase (LKB1). The cosmid clone was directly sequenced using primers, DJ649 

so (SEQ ID NO: 24) corresponding to the 5' end of the LKB1 cDNA sequence and DJ650 (SEQ ID NO: 29) corresponding 
to the 3* end; clone R291 14 and the adjacent clone R26552 contained the LKB1 gene, and thus it was clarified that the 
LKB1 gene was located in this region. For the determination of the entire exon-intron organization of the LKB1 gene 
and for further analyses in detail, PCR primers and sequence primers (shown below; Tables 1 and 2) were designed, 
based on the LKB1 cDNA sequence (1 302bp). Sequences of the respective gene segments were determined by direct 

55 sequencing of the PCR products amplified from the cosmid DNAs. The sequences obtained were compared with the 
LKB1 cDNA sequence to identify positions of the exons and the splice junctions of all the introns. Each exon position 
on the restriction map was determined by comparing the positions of restriction sites in the genomic sequence and by 
comparing the distances, estimated by long distance PCR, between the respective segments of the gene (Figure 1a). 
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Nucleotide sequence was determined by bi-directional sequencing in almost all the regions except for prominently long 
introns 1 and 8. 

[0045] The result showed that the LKB1 gene spanning over 23kb or more was split into 9 coding exons and a non- 
coding exon. Moreover, the gene was presumed to be transcribed toward the direction from the telomere to the centro- 
mere (Figure 2); interestingly, the splice junction of intron 2 was revealed to violate the GT/AG rule characteristic of ordi- 
nary intron-exon junctions. The sequence of intron 2 starts with "ATATCCCTT at the 5' end and terminates with 
"CCCAC"; the TCCTTAAC motif is located 15bp upstream of exon 3. These three sequence elements have close sim- 
ilarities to those of the eukaryotic intron which has been recently reported to be as a very rare intron. Introns with this 
sequence pattern are spliced in a U12 snRNA-dependent manner, which is not the general way of splicing. 

Example 5. Analysis of mutations in patients affected with Peutz-Jeghers syndrome 

[0046] Sequences of the primers used for the mutation analysis of the patients affected with Peutz-Jeghers syn- 
drome are shown in table 1 below. 



Table 1 



DJ705 


5'-GGGAATTCGGAACACAAGGAAG-3' 


(SEQ ID NO: 23/exon 1) 


DJ649 


5'-ATGGAGGTGGTGGACCCGC-3' 


(SEQ ID NO: 24/exon 1) 


DJ659 


5'-GTTACGGCACAAAAATGTCATCCA-3' 


(SEQ ID NO: 25/exon 2) 


DJ666 


5'-GGTGATGGAGTACTGCGTGTG-3' 


(SEQ ID NO: 26/exon 3) 


DJ684 


5 v ACATCGGGAAGGGGAGCTACG-3' 


(SEQ ID NO: 27/exon 6) 


DJ660 


S'-CCGGGCACCGTGAAGTCCTG-S* 


(SEQ ID NO: 28fexon 8) 


DJ650 


5'-TCACTGCTGCTTGCAGGCC-3' 


(SEQ ID NO: 29/exon 9) 


DJ717 


5'-GCAGGCGGCCAGCCTCA-3' 


(SEQ ID NO: 30/exon 9) 



[0047] Mutation analysis was carried out using DNA samples from five unrelated patients affected with Peutz-Jegh- 
ers syndrome (patients A, B, D, MA and FA). The deletion and rearrangement of the gene were screened by long dis- 
tance PCR using the primer sets, DJ659 and DJ660, and DJ666 and DJ660. 

[0048] All the DNA samples derived from the patients and healthy normal persons as controls gave the products of 
3.9kbp in the PCR amplification with the primer set of DJ666 and DJ660. However, not only the 3.9kbp products but 
also the products of 2.5kbp were found in the sample of Peutz-Jeghers patient A (Figure 3b). The shorter DNA fragment 
was subcloned, sequenced and then compared with the chromosomal DNA sequence of the normal counterpart: the 
result showed that the LKB1 gene was complicatedly rearranged in the chromosome of this patient (Figure 1 b). Specif- 
ically, the PCR products of 2.5kbp had two deletion mutations; one is a deletion of a 1286bp segment located between 
intron 3 and intron 5, and the other is a deletion of an 81 bp segment located inside intron 7. The longer deletion con- 
tains exons 4 and 5. In addition to this, an inversion was found in the central region containing exons 6 and 7. The exist- 
ence of the inversion was confirmed by PCR analysis using sense primers, DJ666 and DJ684, which were specific to 
exon 3 and exon 6, respectively. Both of these primers correspond to the sense chain, and therefore PCR products are 
given only when the inversion exists. The analysis has revealed that the inversion is carried by all the affected family 
members, including patient A, for two generations. On the other hand, this mutation was found neither in the healthy 
members of the same family nor in unrelated healthy normal persons used as controls (Figures 3a and 3c). 
[0049] The complicated LKB1 mutation was analyzed at the level of transcripts by RT-PCR using RNA prepared 
from the cells of peripheral blood of patient A. The RNA was isolated using an RNA Isolation Kit (Qiagen) and reverse- 
transcribed into cDNA using Superscript (GIBCO-BRL, Life Technologies). Two distinct PCR products of 730bp and 
270bp were given by PCR using a primer set of DJ660 and DJ666. There is a length difference of about 460bp between 
the two types of the PCR products; the length agrees well with the length between exon 4 to exon 7 in the transcript, 
suggesting the production of the transcript from which the segment of exons 4 to 7 has been removed by splicing, and 
which was presumed to be caused by the complicated mutation. The aberrant splicing event, in which exon 3 directly 
links to exon 8, does not result in frameshift, but the resulting transcript is assumed to encode an aberrant protein con- 
sisting of 281 amino acids in which the C-terminal half of the catalytic domain is missing. 

[0050] The aberrant PCR products were not observed in any of patients B, D, MA, and FA. Eight sets of primers 
were designed for the screening of point mutations in the LKB1 gene (Table 2); the respective exons were amplified 
independently and then directly sequenced (Figure 4). 
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Table 2 





DJ698 


5'-GGTCCCCGAGGACGAAGTTGA-3' 




5 


DJ673 


5*-ACCATCAGCACCGTGACTGG-3' 






DJ703 


S'-TCGCCGGCCGATGACAGA-S' 






DJ674 


5'-AAGGAGACGGGAAGAGGAGCAG-3* 


fSEO ID NO-10/pxnn 2 anti<;pn<;p^ 


70 


DJ690 


5'-GAGGAGGGG C AAGGTGGGT-3 1 


fSEO ID NO-1 1/pxr»n ^ <;pn<;p^ 




DJ680 


S'-GTGTGGCCTCACGGAAAGGAG-S' 


fRFO ID NO"1P/pvnn p nti qpmqp^ 




DJ692 


5'-AGCTGGGCCTGTGGTGTTTG-3 , 




75 


DJ694 


5'-CAGAGGCCCCTCGGAGTGTG-3' 


(SEQ ID No:14/exon 4*5 antisense) 


DJ695 


5M3CCTCTGTCCCTGGGGTAGA-3* 


(SEQ ID NO:15/exon 6 sense) 




DJ693 


5'-TCAGTCCTCTCAATGCCTGCTG-3' 


(SEQ ID NO:l6/exon 6 antisense) 




DJ696 


5'-GCGGGGTCCCCCTTAGGAG-3' 


(SEQ ID NO:1 7/exon 7 sense) \ 


20 


DJ697 


5'-CTAGCGCCCGCTCAACAG-3' 


(SEQ ID NO:18/exon 7 antisense) 




DJ675 


5'-GGAGCTGGGTCGGAAAACTGGA-3' 


(SEQ ID No:19/exon 8 sense) 




DJ702 


5'-TGCTCCCGTGGGACATCCTG-3' 


(SEQ ID NO:20/exon 8 antisense) 


25 


DJ676 


5'-GTAAGTGCGTCCCCGTGGTG-3' 


(SEQ ID NO:21/exon 9 sense) 


DJ677 


5'-GTGGCATCCAGGCGTTGTCC-3' 


(SEQ ID No:22/exon 9 antisense) 



[0051] Since the intron between exon 4 and exon 5 is short, the exons were amplified together at the same time. 
The result showed that all the patients carry mutations in one of the two alleles of the LKB gene. The result is shown in 

30 Table 3. Codon numbers in the table indicate the last wild-type codon. Restriction enzyme sites were diagnosed for the 
presence of mutations in the PCR products of the Peutz^Jeghers patients as well as healthy normal persons. The sym- 
bols, + and -, accompanying the names of restriction enzyme indicate the presence and absence of the site, respec- 
tively. Nucleotides of LKB1 genes derived from patients B, MA and FA have been numbered on referring to nucleotide 
numbers assigned in the LKB1 cDNA sequence deposited in GenBank (accession number: U63333). Nucleotide num- 

35 bers used in the LKB1 sequence from patient D are indicated on referring to those of the LKB1 gene sequence depos- 
ited in GenBank (accession number: AF032985). 



Table 3 



40 


Patient 


Mutation type 


Mutation position 


Codon 


Restriction site 


Possible influence 




A 


inversion/deletion 


exon 4-7 


155 




• deletion of codons 156- 
307 


45 


D 


G2412A 


intron 3 


156 


+Scal 


* deletion of splice accep- 
tor site 

• exon 4 skipping; 
frameshift 


50 












• premature termination at 
codon 242 




B 


deletion of GGTC at 
nucleotide position 716 


exon 5 


240 


-Ahdl 


* premature termination at 
codon 285; frameshift 


55 


MA 


C759A 


exon 6 


252 


-Rsal 


• Tyr253 (TAC)-> Stop 
codon (TAA) 




FA 


deletion of G at nucleotide 
position 843 


exon 6 


280 


-BsrBI 


• frameshift; premature ter- 
mination at codon 286 
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[0052] In patient MA, C is substituted for A at nucleotide position 759 (in exon 6), and as a result. TAC codon of tyro- 
sine is altered to a stop codon, TAA, at amino acid position 253. It was revealed that, in patient B, four base pairs from 
nucleotide position 717 to 720 (in exon 5) were deleted; the deletion results in the generation of a stop codon at the 
position 1 35bp downstream from the codon for tryptophan at amino acid position 239. In patient D, the di-nucleotide AG 
5 to be conserved strictly at the splice acceptor site located at the 3'end of intron 3 was found to be altered to AA. When 
exon 4 is skipped and as a result exon 3 links to exon 5, a frameshift occurs immediately behind amino acid position 
155 and the protein sequence terminates at amino acid 241 . It was found that, in patient FA, nucleotide G at nucleotide 
position 843 (exon 6) was deleted, which caused a frameshift and the generation of a stop codon, TGA, at nucleotide 
857. 

10 [0053] All the mutations found in the LKB1 gene of the five patients affected with Peutz-Jeghers syndrome result in 
loss of the indispensable part in the kinase domain, and as a consequence the conformation is presumed to be 
destroyed in the mutant LKB1 proteins. In addition, the C-terminal amino acids, which were assumed to constitute the 
regulation domain in LKB1 protein, were missing in the four of these patients. These mutations in the LKB1 gene were 
considered to give rise to the loss of kinase activity in the LKB1 proteins as mutant gene products, and the signal trans- 

15 duction pathway in which LKB1 was involved was blocked as a consequence. 

[0054] The genomic organization of the LKB1 gene was similarly investigated in a Japanese patient affected with 
polyposis who was suspected of Peutz-Jeghers syndrome (SK1). The genomic DNA was extracted from the peripheral 
blood of the patient using a QIAamp Blood kit (Qiagen). and the LKB1 gene was amplified by PCR using the DNA as a 
template and using the primers indicated in Table 2. The result of direct sequencing analysis of the PCR products 

20 showed that there was a single-base (C) deletion (the C is the third nucleotide of codon "GAC corresponding to Asp at 
amino acid position 207) in the coding region within exon 5. The mutation results in a frameshift on the C-terminal side 
of Asp 207, which perhaps generates a truncated protein without the kinase activity as a result. 
[0055] The PCR amplification of the LKB1 gene derived from the DNA sample was performed basically under the 
following conditions: the reaction solution, of which total volume was 50 ul, contained 100 ng of chromosomal DNA 

25 (template DNA), 50pmol of primer, PCR buffer J (Invitrogen), 2.5 units of AmpliTaq (Perkin Elmer), and 2.5 ul of DMSO, 
or the solution contained 1 00 ng of chromosomal DNA (template DNA), 20pmol of primer, 5 ul of 1 0x TaKaRa Tag buffer 
(TaKaRa), 4 ul of 2.5mM dNTPs (TaKaRa), 0.4 ul of TaKaRa Tag DNA polymerase (TaKaRa), and 0.4 ul of Taqstart™ 
Antibody (CLONTECH); the reaction profile was: pre-heat at 94°C for 2 to 4 minutes; 35 cycles of denaturation at 94°C 
for 30 to 45 seconds, annealing at 58°C or 62°C for 30 seconds and extension at 72°C for 45 seconds; and the final 

30 extension at 72°C for 3 minutes. The resulting PCR products were purified using a QIAquick Gel Extraction kit (Qiagen) 
after agarose gel electrophoresis, or purified from the PCR reaction solution using a QIAquick Nucleotide Removal kit 
(Qiagen); using the purified PCR products as a template, direct sequencing was performed bidirectionally. A Dr hod am- 
ine terminator cycle sequencing kit (Applied Biosystems) was used in the sequencing. 

[0056] In order to verify that the mutations found were not caused by artifacts in PCR amplification or in sequencing, 
35 DNAs covering the regions containing the mutations were amplified by PCR under the conditions described above, the 
resulting PCR products were digested with restriction enzymes selected for the respective mutations, and then the 
lengths of the fragments obtained by the digestion were diagnosed for the mutation sites (PCR-RFLP analysis). Specif- 
ically, a 20 ul reaction solution consisting of 5 ul of PCR products, 4 units of restriction enzyme (AMI, BsrS I, Rsa\ or 
seal) and 2 ul of restriction enzyme buffer was incubated at 37°C for 1 .5 hours; the resulting lengths of the DNA frag- 
40 ments were analyzed by agarose electrophoresis. 

[0057] When the DNA samples from 50 healthy persons were diagnosed by the same analysis, the mutations were 
not observed in these samples at all; thus the results showed that the mutations were specific to patients with Peutz- 
Jeghers syndrome (Figure 5). 

45 Example 6. Construction of plasmid DNAs for the expression of the LKB1 gene in E. coii and mammalian cells 

[0058] A DNA fragment encoding the entire LKB1 amino acid sequence with a c-Myc epitope sequence (SEQ ID 
NO: 41/Glu Gin Lys Leu lie Ser Glu Glu Asp Leu) as a C-terminal tag was amplified by PCR using the LKB1 cDNA clone 
as a template and using LK E1 primer (SEQ ID NO: 39/5'-gat gaa ttc ggg tec age atg gag gtg gtg gac S') and LK E2 

so primer (SEQ ID NO: 40/5'-gat gaa ttc tta gag gtc ttc ttc tga gat gag ctt ctg etc ctg ctg ctt gca ggc cga-3') ; the PCR prod- 
ucts were digested at the Eco Rl sites located within the primer sequences; the Eco Rl fragment was inserted into 
pcDNA3 vector (Invitrogen) at the Eco Rl site for the expression in mammalian cells or into pGEX-5X-1 vector (Pharma- 
cia) at the site for the expression in E. coli. The cloned DNAS were diagnosed by sequencing not to have any sequence 
variations as artifacts caused by PCR amplification; and such clones were selected and used for expression experi- 

55 ments. 

[0059] in vitro mutagenesis was carried out using pcDNA3/LKB1myc as a template and using GeneEditor™ 
(Promega), to prepare expression plasmid DNAs with the mutations corresponding to the amino acid substitutions 
found in the Peutz-Jeghers patients. Specifically, the expression plasmid DNA (pcDNA3/LKB1 D176Nmyc) for D176N 
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mutation (the abbreviation means the substitution mutation from Asp to Asn at amino acid 1 76; the mutations shown 
below are also abbreviated in the same way) was constructed by introducing the mutation using LK D1 76N primer (SEQ 
ID NO: 45/5'-att gtg cac aag aac ate aag ccg ggg-3 1 ); the plasmid for mutation W308C (pcDNA3/LKB1 W308Cmyc), LK 
W308C primer (SEQ ID NO: 46/5'-cgg cag cac age tgc ttc egg aag aaa-3*); the plasmid for L67P mutation 

5 (pcDNA3/LKB1 L67Pmyc), LK L67P primer (SEQ ID NO: 47/5'-gtg aag gag gtg ccg gac teg gag acg-3 1 ); plasmid DNA 
for K781 mutation (pcDNA3/LKB1 K781 myc). LK KM primer (SEQ ID NO: 48/5-agg agg gec gtc ate ate etc aag aag-3') 
The DNA strand was newly synthesized by annealing the single-stranded template DNA of the plasmid with two prim- 
ers; one was the primer for the introduction of the mutation, and the other was a selection primer (for bottom strand) 
appended to the kit. The prepared DNA was introduced into E. coli cells; clones resistant to the antibiotic, Gene Editor 

-10 TM, were selected to obtain the cells containing the plasmids with the mutation. The plasmids, which were verified to 
have the mutation by sequencing, were used for expression experiments. 

[0060] The C-terminal end of LKB1 protein can be tagged with an epitope peptide (SEQ ID NO: 44/NH 2 -YPYDVP- 
DYASL-COOH) recognized by anti-HA antibody or tagged with a sequence of consecutive 6 histidine (H) residues (his- 
tidine hexamer), by introducing, into a variety of vectors, the DNA fragments prepared by PCR using LK E4 primer (SEQ 
is ID NO: 42/5'-gat ggg ccc tta cag gga ggc ata gtc agg cac ate ata tgg gta ctg ctg ctt gca ggc cga-3') or LK E5 primer 
(SEQ ID NO: 43/5'-gat gaa ttc tta gtg atg gtg atg gtg atg ctg ctg ctt gca ggc cga-3*), respectively, instead of LK E2 primer. 
By utilizing the tagged constructions, LKB1 protein can be detected by the methods using with anti-HA antibody or anti- 
histidine hexamer-antibody, or can be purified by an affinity-purification method with anti-HA antibody or the nickel col- 
umn. 

20 

Example 7. Gene expression of LKB1 in mammalian cell and the kinase activity assay 

[0061] About 10 jig of the expression plasmid DNAs, including the plasmid DNA prepared for LKB1 expression 
(pcDNA3/LKB1 myc), was introduced into COS7 cells (by transfection) by the method using SuperFect (Qiagen). Spe- 

25 cifically, after about 10 6 COS7 cells were placed in a 10cm dish and cultured overnight, a mixture consisting of 10 ^g 
of plasmid DNA and 60 \j\ of SuperFect was added thereto and then cultured for about 3 hours. Subsequently, the cells 
were further cultured in a fresh medium for 1 to 2 days, and then harvested by using a trypsin- EDTA solution. The cells 
were suspended in an NP40 kinase lysis buffer (10mM Tris-hydrochloride (pH7.8), 1% NP40, 0.1 5M sodium chloride, 
1mM EDTA, 50mM sodium fluoride. 5mM sodium pyrophosphate, 10 ug/ml aprotinin, and ImM PMSF); the proteins 

30 were solubilized by stirring the suspension at 4°C for 30 minutes. Protein A/G plus agarose (Santa Cruz) was added to 
the cell lysate obtained, and the mixture was stirred for 30 minutes to remove non-specific proteins adsorbed by the 
beads. Anti-c-Myc antibody A14 (Santa Cruz) was added to the mixture and then the mixture was left to stand at 4°C 
for 1 hour; protein A/G plus agarose (Santa Cruz) was added thereto, and the mixture was allowed to stand for another 
1 hour. The resultant immune-complex was precipitated from the mixture by centrifugation, washed several times with 

35 the NP40 kinase lysis buffer, a buffer containing 1 M sodium chloride and a solution of 50mM Tris-hydrochloride (pH7.8); 
the precipitate was used to assay the kinase activity. 

[0062] The kinase assay was performed in the reaction system of a 50jil solution containing 50mM Tris-hydrochlo- 
ride (pH7.8), 1mM DTT, 10mM divalent cation (Mn, etc.), and 10 nCi of [y- 3Z P] ATP. The immunoprecipitate was incu- 
bated in the kinase assay solution at 37°C for 30 minutes and then the reaction was stopped by adding the SDS-PAGE 
40 sample buffer and boiling; the sample was subjected to SDS-PAGE. The gel was fixed with a methanol/acetic acid solu- 
tion, dried and then analyzed by a BAS200 image-analyzer (Fuji Rim). The kinase activity of LKB1 assayed was evalu- 
ated as the autophosphorylation activity. The protein expressed was detected by SDS-PAGE followed by Western 
blotting using anti-c-Myc antibody A14. 

[0063] The divalent cation-dependency in the kinase activity was first diagnosed, and the result showed that the 
45 kinase activity was hardly enhanced by Mg 2+ but enhanced intensely by Mn 2+ (Figure 6). The kinase activity was diag- 
nosed in the mutant proteins containing amino acids substituted in the Peutz-Jeghers patients; none of the mutant pro- 
teins exhibited the kinase activity (Figure 7). These findings strongly suggest that LKB1 kinase loses its activity in 
Peutz-Jeghers patients and which becomes the cause of the disease. 

so Example 8. Expression of LKB1 protein in E. coli 

[0064] Plasmid pGEX/LKB1 myc was introduced into E. coli strain DH5 a, and a single-colony was selected. The E. 
coli cells were cultured in 10 ml of 2xYT medium at 37°C overnight; an aliquot taken from the culture was diluted 100 
times by fresh 2xYT medium; the E. coli cells were further cultured at 37°C until the OD value of the medium measured 
55 at 600 nm reached 0.6. IPTG (isopropyl p-D(-)-thiogalactopyranoside) was then added to the culture at a final concen- 
tration of 0.1 mM, and the culture was prolonged for another several hours. The bacterial cells were collected by centrif- 
ugation, suspended in PBS containing 1% TritonX-100 and 1% Tween 20 and lysed by sonication to solubilize their 
proteins. LKB1 protein expressed as a fusion protein with glutathione-S-transf erase (GST) was purified from the solu- 
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bilized sample by an affinity purification method using glutathione Sepharose 4B (Pharmacia). The LKB1 protein 
expressed in E. coli was also detectable by Western blotting using anti-c-Myc antibody A14 (Figure 8). 

Example 9. Preparation of antibody against LKB1 protein and the use thereof 

5 

[0065] Two peptides were synthesized (Sawady Technology) based on the amino acid sequences of the N-terminal 
end and the C-terminal end of LKB1 protein, and the sequences are as follows: NH 2 -CHRIDSTEVIYQPRRKRAKL- 
COOH (SEQ ID NO: 34) of LKB1 P6 peptide (amino acid 27-45) and NH 2 -CLSTKSRAEGRAPNPARKA-COOH (SEQ 
ID NO: 31) of LKB1 P3 peptide (amino acid 400-417). The respective peptides were conjugated at the N-terminal 
10 cysteine with keyhole limpet hemocyanin (KLH) by a method using m-maleimidebenzoyl-N-hydroxysuccinimide ester 
(MBS) (Sawady Technology). The conjugated peptides were given to rabbits several times as antigens for immuniza- 
tion. The specific antibodies reacting to these peptides were purified from the antiserum by affinity columns prepared 
using cellulofine (Seikagaku Co.) to which the peptides were (inked. 

[0066] The results of Western blotting with these antibodies (anti-LKB1 P6 antibody and anti-LKB1 P3 antibody rec- 
15 ognize the N-terminal and C-terminal ends of LKB1 protein, respectively) are shown in Figure 9. The results showed 
that the band (indicated by arrow) of about 55 kDa corresponding to the LKB1 protein expressed in COS7 cells was 
delectable by each antibody. Because the positive immunoreactton was blocked when the antibody was pre-incubated 
with a large excess of the corresponding peptide antigen, the reaction was considered to be specific to the epitope. In 
addition, no cross-reaction was seen with the lysate from HeLa S3 cells; the cells express no LKB1 protein; and thus it 
20 is safe to conclude that LKB1 protein can be detected with high specificity by these antibodies. It was also shown that 
the LKB1 protein expressed as a GST fusion protein in E. coli was detectable with anti-LKB1 P3 antibody (Figure 8). 
[0067] The results of immuno-staining of human tissue sections with the antibodies are shown in Figures 10, 1 1, 
and 12. Figure 10 shows an immu no-stained tissue section of human fetal colon. The cytoplasm of the epithelial cells 
was positively stained. The cells, which are assumed to be endocrine cells, are also stained very intensely. Figure 1 1 
25 shows an immu no-stained tissue section of human adult pancreas. The islet cells are stained positively. Figure 12 
shows an immuno-stained tissue section of human fetal testis. The undeveloped germ cells are stained intensely. The 
immuno-staining patterns shown were obtained with anti-LKBI P3 antibody; similar results were also obtained with 
anti-LKB1 P6 antibody. The positive immunoreactions shown were blocked when the antibody was pre-incubated with 
the corresponding peptide antigen, so that the staining was considered to be specific. 

30 

Industrial Applicability 

[0068] The present invention revealed that Peutz-Jeghers syndrome was caused by the mutations in the LKB1 
gene. The invention made it possible to diagnose and treat the diseases caused by the mutations in the LKB1 gene, 
35 such as Peutz-Jeghers syndrome, by utilizing the LKB1 gene, primers or probes designed based on the sequence 
thereof, LKB1 protein and the antibodies thereto. 
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SEQUENCE LISTING 

<110> CHOGAI RESEARCH INSTITUTE FOR MOLECULAR MEDICINE, INC. 

<120> DIAGNOSTIC METHOD, DIAGNOSTIC REAGENT AND THERAPEUTIC PREPARATION 
FOR DISEASES CAUSED BY VARIATION IN LEB1 GENE 

<130> C2-909DP1PCT 

<150> JP 1997-344256 
JP 1998-280357 

<151> 1997-11-27 
1998-10-01 

<1B0> 48 

<210> 1 
<211> 3765 
<212> DNA 
<213> Homo sapiens 

<400> 1 

gcccactgct ttatgcggcc gtttcatttg atgtaatccg agcccggtgc gaagccgaag 60 

caggtggatc acctgaggtt aggagttcgg gaccagcctg aacaatatgg tgaaaccctg 120 

tctctattga aaatccaaaa attatccagc catggtggtg tatgcctgta gtcccagcta 180 

ctccagaggc tgagacagga gaattgcttg aatctgggaa gcagaggttg cagtgaacca 240 

agatcatgcc attgcactcc agcccgggca acagagcgag actccgtctc aaaaaaaaaa 300 

aaagagagtc agtgtggagg gaagtgtctc tgttgggtcc gggtgctctg tgggactctg 360 

aggaaaagct cgcaccaggt agatgctgtt ctgtccccat gggtaaagga caccccaaca 420 

aaccgaggag caaaatgtcc tcgtatgccc tcttcgcgca gacctgccag gaagagcaca 480 

agaagaaaca cccggactct tctgtcaatt tcgtggaatt ctccaagaag tgtttggaga 540 

gatggaagac cacgtctgca aaggagaagt gaagtttgaa gagaaggcaa aaagtgacaa 600 
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aori"*i"f»«rf»+tf+ frApficror<rnfp& +i"JU*J*A5\f"flfl PJlttppi'PPP' 
d£LLLgLLgL 5at<l555(i6a ui«aaaaatui vatu^u^tLg 


aaciLg Uiaga 




UV/U 


5 


aagga&ag&a aaaggaLLgC act LgC LL C La gaaggCCaCC 


O 4" A f" f7A a44/» 

aLCLgCC LLC 


L LC C LgX L L L 


1 CU 




gctctgaaca tcgcccaaag atcaaaagtg gacacccagg 


aa4»a4>4»4»«*4-a 
CCLatLLgXC 


gtggaaactg 


roU 


10 


caaagaaact gggxgaaatg tggtc tgggc agtcagccaa 


agataaacaa 


ccatatgagc 


o4U 




agaaagcagt taagctacag gagagatatg aaaagggtat 


tgetgeatat 


cgtgctaagg 


900 


15 


gcaaaagtga agcaggaaag aagggctcaa agaagaacaa 


accagaagat 


gaggaggagg 


n<* a 

960 




aggaggagaa agaagatgaa gatgaggagg aagagggtga 


agatgaagaa 


taaatggcta 


1 AO A 

1020 


20 


tcctttaatg atgcctgtgc a*jtgggcttg ttttgctaag 


1 1 1 1 

aatgtgaatt 


ctagtacagc 


4 no a 

1080 




tcagtattag cttcagtata aaactgtaca aattttcgta 


tagctcataa 


gattctctgt 


1 4 if A 

1140 




X — AAiiiAiiM ^4-4-4-4- .1X11 4 f f f J»fLf»JL Jlfl 

acagaaaata cttttctttt cttttctttt ttttgagaca 


gagtttcget 


cttgttgcct 


1 OAA 
1ZU0 


25 


aggctggagt gcaatggcgt gatctcggct caccgcaacc 


tccgcctccc 


gggtcctggt 


i OCA 

IZbO 




tcaagcagtt ctcctgcctc agcctcctga gtagctggga 


ttacaggcac 


atgccaccac 


lo^U 


30 


gcucagctaa tutttgtaxt ttuagxagag aiggggTxic 


accatgXLgg 


ccaggcxgcx 


1 **qa 




44~A<kooA4*AA -4" flr^ a a 4" a f*4r ft of aa cta /ff itp + t aopifaa4" a a 

LLcaaacxcc tgacc LCgxg atccgccLgc xtcggcc tec 


caaaaLacLg 


<r<raf 1 0 a aver 

gga l uac agg 




35 


Lgxgagccac cgcacccLgc CLaatgrccc caaaLaLLLa 


a+crcrttt 
aLggXLLLLa 


a 0 a 0 -a 4"+ 4" q + 

aaaaaLLLax 






4 it 4 «» 4 0 4- ctcrr* qit/>o AQif aaa a a 4" 4" c^4" 0 t^tfro a 4" 4 0 or4" q 4 a 0 

LgLgLaLggC agcacagCaC aCLLgxagga aLLagLatCa 


ac ag utc a lc 


L LgCgLL L L L 


100U 


40 


f qq era \ ota 4" <r AQf+t^ffaa /ia4"44+irf oa f'aaaa'f tat'if 
LdogdlgCLg Cdllllllaa LallllgUIa LaaaaL LcLLg 


eg LaLLaaaa 


O si a /* B a 9 ITS A 
il£U*L CXCL^^nvltl 


lULU 




attccgtgtg tagttcacac tcacagcaca tctccgtcca 


ggcacttgag 


agaatgacta 


1680 


45 


ggaggggttc ttggaggagg tggtctttga aeggagaate 


catcttcaag 


gattctgtct 


1740 




gtaatggtca ccaagtattt cctgagtcac ttccatgtgt 


cctgcagttc 


tctgaagggg 


1800 


50 


cgtgggacct accgatgcca attatccagc attatctcca 


gattccaaga 


a^ttggggtg 


1860 




tgagecagea atcagtacag aaaagagata ccaaaataag 


tttgagttgg 


ggagtgttcc 


1920 



55 
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ttcaacttca gttttctgga agagatcttt tttttttttt 


tgagacagag 


tttegctett 


1980 


5 


attgcccaag ctggagtgca gtggcacgat ctcggctcac 


cgcaacctcc 


tcctcccggg 


2040 




ttcaagcgat tctcctccct cagccttctg agtagctggg 


attacagaca 


tgcacctgta 


2100 


10 


atttctacta aaaatacaaa aattagccgg gcgtggtggc 


gcacgcctgt 


aatctcagct 


2160 




actggggagg ctgaggcagg agaatcgctt gaaaccagga 


ggeggagatt 


gtaccaagat 


2220 


15 


agtttgttcc agctaaacaa cctggcgcta gtgcaggaaa 


aggtggaagg 


caeggggcta 


2280 




gcacaggagg gttcaatatt ttcaacctta tcaagccata 


ttttggcaac 


tcttgttttt 


2340 


20 


cacgagaagc ccccgctggg cttgtcccag cgctgtcctg 


aggcttcccc 


catgagttcc 


2400 




gatagggcag aggccgccct gagcgtttct ctttcccctg 


gtccaagagt 


ggctcaaaag 


2460 


25 


aaggattttt gactggaatt ggccactttg tgttactttt 


tgacccttga 


cctcgcccca 


2520 




aagggggatg cgggggaggg gctctggtag gggtggcccc 


gctccttcca 


ggtccgcaag 


2580 


30 


cccaggttcc cgcccaccgg gctcagccca ccctgcggcc 


gttcagggag 


gccgttggca 


2640 




cccgtgacct acgaccccct tcccgagccc caccgaggtc 


acagccgtgg 


cctcgtctcc 


2700 


35 


ccatgcctgc ttcccgcccc ctgcccgtga cgggcgtctc 


cgaggaccaa 


tgagegeget 


2760 




gtatccaccc ctcgggcggg gccaagcgcc gaccaatcgc 


cgctcgggcg 


cccggccggg 


2820 


40 


tccaaacgct ccaatcgtca gcggcggcgg ggcgggcaga 


gggccgggga 


tggcaggttc 


2880 




aalOadCggg M&gCaCgXC gLCULCgUga ggaSgCgLgC 


#» f* +• tri* tf tr /» r* cr 
CC UgCggCCg 


trim trY 9t* w\ 




45 


gtccgcggcg gegcagggag ggggagggag gtaaacaaga 


tggeggegge 


gtgtcgggCg 


3000 




eggaaggggg aggcggcccg gggcgcccgc gagtgaggcg 


eggggeggeg 


aagggagege 


3060 


50 


gggtggcggc acttgctgcc gcggccttgg atgggctggg 


cccccctcgc 


cgctccgcct 


3120 




cctccacacg cgcggcggcc gcggcgaggg ggacgcgccg 


cccggggccc 


ggcaccttcg 


3180 



55 
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ggaacccccc ggcccggagc ctgcggcctg 


cgccgcctcg gccgccggga 


gccccgtgga 


3240 


5 


gcccccgccg ccgcgccgcc ccgcggaccg 


gacgctgagg gcactcgggg 


cggggcgcgc 


3300 




gctcgggcag acgtttgcgg ggaggggggc 


gcctgccggg ccccggcgac 


caccttgggg 


3360 


10 


gtcgcgggcc ggctcggggg gcgcccagtg 


cgggccctcg cgggcgccgg 


gcagcgacca 


3420 




gccctgagcg gagctgttgg ccgcggcggg 


aggcctcccg gacgccccca 


gccccccgaa 


3480 


15 


cgctcgcccg ggccggcggg agtcggcgcc 


ccccgggagg tccgctcggt 


cgtccgcggc 


3540 




ggagcgtttg ctcctgggac aggcggtggg 


accggggcgt cgccggagac 


gcccccagcg 


3605 


20 


aagttgggct ctccaggtgt gggggtcccg 


gggggtagcg acgtcgcgga 


cccggcctgt 


3660 


gggatgggcg gcccggagaa gactgcgctc 


ggccgtgttc atacttgtcc 


gtgggcctga 


3720 


25 
30 


ggtccccgga ggatgaccta gcactgaaaa 


gccccggccg gcctc 




3765 


<210> 2 
<211> 1006 
<212> DNA 
<213> Hobo sapiens 








35 


<220> 
<221> exon 
<222> (1)..<628) 








40 


<220> 

<Z21> mtron 
<222> (629)-. (1006) 








<400> 2 

cccagggtcc ccgaggacga agttgaccct 


gaccgggccg tctcccagtt 


ctgaggcccg 


60 


45 


ggtcccactg gaactcgcgt ctgagccgcc 


gtcccggacc cccggtgccc 


gccggtccgc 


120 




agaccctgca ccgggcttgg actcgcagcc 


gggactgacg tgtagaacaa 


tcgtttctgt 


180 


50 


tggaagaagg gtttttccct tccttttggg 


gtttttgttg cctttttttt 


ttcttttttc 


240 




tttgtaaaat tttggagaag ggaagtcgga 


acacaaggaa ggaccgctca 


cccgcggact 


300 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



cagggctggc ggcgggactc caggaccctg ggtccagc atg gag gtg gtg gac ccg 356 

Met Glu Val Val Asp Pro 
1 5 

cag cag ctg ggc atg ttc acg gag ggc gag ctg atg teg gtg ggt atg 404 
Gin Gin Leu Gly Met Phe Thr Glu Gly Glu Leu Met Ser Val Gly Met 
10 15 20 

gac acg ttc ate cac cgc ate gac tec acc gag gtc ate tac cag ccg 452 
Asp Thr Phe He His Arg He Asp Ser Thr Glu Val He Tyr Gin Pro 
25 30 35 

cgc cgc aag egg gec aag etc ate ggc aag tac ctg atg ggg gac ctg 500 
Arg Arg Lys Arg Ala Lys Leu He Gly Lys Tyr Leu Met Gly Asp Leu 
40 45 50 

ctg ggg gaa ggc tct tac ggc aag gtg aag gag gtg ctg gac teg gag 548 
Leu Gly Glu Gly Ser Tyr Gly Lys Val Lys Glu Val Leu Asp Ser Glu 
55 60 65 70 

acg ctg tgc agg agg gec gtc aag ate etc aag aag aag aag ttg cga 596 
Thr Leu Cys Arg Arg Ala Val Lys lie Leu Lys Lys Lys Lys Leu Arg 
75 80 85 

agg ate ccc aac ggg gag gec aac gtg aag aa gtaagtatggc ttgctggggt 649 
Arg lie Pro Asn Gly Glu Ala Asn Val Lys Lys 
90 95 

eggggceggg ccgggccagt cacggtgctg atggttctgt cttccttcct tctctcctcc 709 

ctccctccct tacttcctct taacaccctg agctggaccc gtctggcgcc tgtgtcctcc 769 

gtgccaggga gagcgtggtt gggggcctgc gttaeggact ttcactcagg caaggecagt 829 

tgtcgcagcg gggcgtgcgt ttgcatgggc tcttggactc cagttaaaat gecctggtag 889 

cgaaaccctc ctgagaaggg agcggccccc aatcccctaa gactagcccc ttggctcccc 949 

cagctgtcca aggagcagag gcgcccagtg gaatcagect gtgtttgttt gggcccc 1006 

<210> 3 
<211> 5661 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> intron 
<222> (1)..(456) 

<220> 

<221> exoa 

<222> (457).. (540) 

<220> 

<221> intron 
<222> (541). .(1363) 

<220> 

<221> exon 

<222> (1364).. (1453) 

<220> 

<221> intron 

<222> (1454).. (2412) 

<220> 

<221> exon 

<222> (2413).. (2545) 

<220> 

<221> intron 

<222> (2546).. (2620) 

<220> 

<221> exon 

<222> (2621).. (2757) 

<220> 

<221> intron 

<222> (2758).. (3252) 

<220> 

<221> exon 

<222> (3253).. (3380) 



21 





PP 1 fLIfi 844 A1 




5 


<220> 

<221> intron 

<222> (3381).. (3988) 




10 


<220> 

✓991 > pyrin 

<222> (3989).. (4046) 




15 


<220> 

*»clw miron 

<222> (4047).. (5024) 




20 


<220> 

^uLW exon 

<222> (5025).. (5212) 




25 


<221> intron 




ggtgaaaccc tgtctgtact aaaatacaaa aaaattagcc ggctgtggtg acacgtgcct 


60 


30 


gtagtcccag ctactcggga ggctgaggca gggaaatcac ttgaacctgg gaggcagagg 


120 




ttgcagtgag ctgagatcgg gccaccgtgc tccagcctag ggacagagca agactctgtc 


180 


35 


tcagaaaaaa aaaaaagtgt cctaactgtg tcctccaaag ccctcgccgg ccgatgacag 


240 




actagagggc gctgtgctcc caccccctac cgccctgagc ctggacgcgt ggcccctgca 


300 


40 


gggccctttc ccacagcact gtgaactcac agcttctctc tagggaaggg aggaggtacg 


360 




ccacttccac agggagatgg ggaggccgac tccagggatc caggccatca tcctgacgtt 


420 


45 


gggtcggctg atacacccct gtcctctctg tcccag g gaa att caa eta ctg agg 

Glu He Gin Lea Leu Arg 
100 


475 


50 


agg tta egg cac aaa aat gtc ate cag ctg gtg gat gtg tta tac aac 
Arg Leu Arg His Lys Asn Val He Gin Leu Val Asp Val Leu Tyr Asn 
105 110 115 


523 
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gaa gag aag cag aaa at atatcctttcc ggtgttggga ccgcggggcc 571 
Glu Glu Lys Gin Lys Met 
120 125 



4-np rf\~ wmhm tT 

xccgxgggag 


gggCXggggC 


CC W»5|>XCCg 








001 


xccxxgaagg 


agaCtggCaC 


acgagggccg 


xggcctxccc 


xggxxccccg 


era a «y 4 natf aa 

gaagxcagcc 




attgtggcaa 


xggctgcgca 


l*A 4" T A T A A 

gcxxgcxgaa 


aggggcccxg 


A A> A A A T #TA* A A 

agcccxggcc 


A A T — T' ITT AT" T 

ccxgxgxcxx 


f Dl 


gggcccgtgg 


ggtgtcaagt 


cccxxxxxxc 


tcagagtctc 


ctcccaggct 


aaccaggggt 


$31 1 
oil 


gtagccacgg 


tctgcctgag 


acaggccacg 


A AYPiff A T* A* A A A 

cgggcxgacc 


gxxgxgggcc 


a 4444if <f t 

axxxxggxcg 


$27" 1 


tggctgggcg 


T" tW I A AT A ift 

XgXCCXCgXg 


xcaxcxgxgg 


acacccccax 


crcrtt \ /■» 4" 4" a /> 

gsgxcxxacg 


ggcacagccx 


%W± 


ccc xacgggg 


cm A>T T T IfAT ■ A 

aCXXXgCXXC 


/IT rt O «VCP *1 f+ 4" 

CXooggCCCX 


gxgcccagag 


CaagagCCag 


a a if 4 <r <r 4 r»r»4" 

aagxggxccx 


QQ1 


gaggctgggg 


ctgtgttccc 


tgagccacgc 


ggtcaggggc 


cctgggaccg 


tcctgcatgg 


1051 


gcccgagcct 


gcttgggggg 


gcgtccagga 


ggcaccatcc 


cccgcccatg 


ggcagggtgg 


1111 


gggacgtgag 


ccccgcagga 


acgctgcccc 


aagagtcagc 


cctgtcctcc 


ccttccccgt 


1171 


aggctccttc 


ctcctgggac 


gctggggccc 


ctgggccttt 


tcagaggggt 


ggctgagggc 


1231 


agggtgggcc 


ctggtcccga 


ggaggggcaa 


ggtgggtgca 


gagggtccct 


ccagagcccc 


1291 


ttttctggcc 


cccgtgctcc 


ctgggcctgt 


gagtggggcc 


gccccctgag 


ctgtgtgtcc 


1351 


ttagcgcccc 


ac g tat atg gtg atg gag tac tgc 


gtg tgt ggc atg cag 


1400 



Tyr Met Val Met Glu Tjrr Cys Val Cys Gly Met Gin 
40 130 ~ " 135 

gaa atg ctg gac age gtg ccg gag aag cgt ttc cca gtg tgc cag gec 1448 
Glu Met Leu Asp Ser Val Pro Glu Lys Arg Phe Pro Val Cys Gin Ala 
45 140 145 150 

cac gg gtgcgtgcgcg gggcaggggc cagggtgggg egggggcegg gggecaggea 1504 
His Gly 

so 155 

gggcaggctc ctttccgtga ggccacactg cttgtcctga tattcattga catgaaggee 1564 
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caagtttttt tgtttttttg tttttttgtg ttttttttcg agatggafftc tcactctgtc 


1624 


5 


gcccaggctg gagtgcaatg gtgcgatctc ggctcactgc aagctccgcc tccgaggttc 


1684 




acgccattct tctgcctcag cttcccgagt agctgggatt acaggcgccc gccaccacgc 


1744 


10 


ccggctaatt ttttgtattt ttagtagaga cggggtttca ccgtgttagc caggatggtc 


1804 




tcaaactcct gacctcgtga tccgcctgcc tcagcctccc aaagtgctgg gattacaggc 


1864 


15 


atgagctacc acgcccggcc ttgtaaaggc ccaagttttt aaaaacagtt ttggggtccc 


1924 




ccatgtgtgg catccacagg cagggctgct gccaacctcc cgcctccatc tttgctgggc 


1984 


20 


ctgctgcctg aggccagtgg cctgcttcca gcccatcgct ggcagccgcc tgccctgacc 


2044 




agatctcctg gatgcaggtc tgtggcctca gagtcagggc cccttgctgc tgcaggacca 


2104 


25 


caggggcagg gaggggcctg ctgttccagc aagactttgg ggtgcagccg gcctgtggcc 


2164 




cacaggaaaa tgagacctgt ggacatccgg ggccctgcca gacgtggctc ggccggacga 


2224 


30 


gggtggccac tgcaggcgca ggtgtggctc cctgctggac ctagcctttc ctctgtcctg 


2284 




tgtgcctgga cttctgtgac ttcccagctg ggcctgtggt gtttgggagg ctcccaggca 


2344 


35 


gctgcaaagg ggacccctgt gaggggcagg gaggcctcgg ccccaggacg ggtgtgtgct 


2404 


40 


gcccgcag g tac ttc tgt cag ctg att gac ggc ctg gag tac ctg cat 
Tyr Phe Cys Gin Leu He Asp Gly Leu Glu Tyr Leu His 
160 165 


2452 


45 


o rrr» r>o cr trtTn »++■ or+cr r>qp ucktf <m#» ofp oacr nnet crfftr AflP ftcr P't'tr etc 
age Cag ggC att gig CaC aag gal aw* aag Lug ggg aac ttg ttc 

Ser Gin Gly He Val His Lys Asp He Lys Pro Gly Asn Leu Leu Leu 
170 ITS 180 


2500 


50 


acc acc ggt ggc acc etc aaa ate tec gac ctg ggc gtg gec gag 
Thr Thr Gly Gly Thr Leu Lys lie Ser Asp Leu Gly Val Ala Glu 
185 190 195 


2545 




gtaggcacgt gctagggggg gccctggggc gccccctccc gggcactccc tgagggctgc 


2605 
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10 



20 



25 



30 



35 



45 



50 



55 



acggcaccgc cacag gca ctg cac ccg ttc gcg gcg gac gac acc tgc egg 2656 
Ala Leu His Pro Phe Ala Ala Asp Asp Thr Cys Arg 
200 205 210 

acc age cag ggc tec ccg get ttc cag ccg ccc gag att gec aac ggc 2704 
Thr Ser Gin Gly Ser Pro Ala Phe Gin Pro Pro Glu He Ala Asn Gly 
215 220 225 

ctg gac acc ttc tec ggc ttc aag gtg gac ate tgg teg get ggg gtc 2752 
Leu Asp Thr Phe Ser Gly Phe Lys Yal Asp He Trp Ser Ala Gly Val 
230 235 240 

acc ct gtaagtgcccc gcccccccgg gcactcacca cacgcacact ccgaggggcc 2808 
Thr Leu 



245 










tetgegtett 


gggcagctgc 


cggcctgtgg 


gcgcagggcg tggccaccgg cccagaccct 


2868 


ctctggccac 


ageegctagg 


gggtgcttac 


tttatggaaa tgtaactcat aeggcagatg 


2928 


gtggttcacc 


cgtgtgaagt 


gcagcctggc 


ccgtcaggga tcttcacaga gtggcacggc 


2988 


cgaccctcct 


cccagagccc 


cacagggaag 


ctgggcgggt gacagcagct ccaggcccct 


3048 


tccccgggtg 


ggtccagagg 


acactcccct 


cctaccccgt agcctccact agtggaaggt 


3108 


ggtgaagaca 


gaggtgtcct 


tgagtccaca 


gggcctctgg tccagcagcc acgggacgcc 


3168 


tctgtccctg 


gggtagagct 


ggggctccta 


gggegtcaac caccttgact gaccacgcct 


3228 


ttcttccctc 


ccctcgaaat 


gaag c tac 
Tyr 


aac ate acc acg ggt ctg tac ccc 
Asn He Thr Thr Gly Leu Tyr Pro 


3280 



40 250 " 

ttc gaa ggg gac aac ate tac aag ttg ttt gag aac ate ggg aag ggg 3328 
Phe Glu Gly Asp Asn He Tyr Lys Leu Phe Glu Asn lie Gly Lys Gly 
255 260 265 270 



age tac gee ate ccg ggc gac tgt ggc ccc ccg etc tct gac ctg ctg 3376 
Ser Tyr Ala lie Pro Gly Asp Cys Gly Pro Pro Leu Ser Asp Leu Leu 
275 280 285 

aaa g gtgggagcctca tccctctgcc cgcagcccca gggaggcggg gcttttgtgc 3432 
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Lys 

agaaatgtag ggttgggggt gtcaggtggg gggctattgg ccccgagacc ccagcaggca 3492 

ttgagaggac tgagtggaga ggccgacctc cccgcagggc ctggtttgcc aggtccctca 3552 

gctccaccct gcttctgggc cctgttcacc ctccgaactc ccaccccaga gggcagtgct 3612 

gccctgcgcc tcccccagcc ccaccctcgg gggctccctg gcttgcaggg tctgtcaggg 3672 

ttgtcctgct gcacttccta cgcatggcag caggtggcac tggccgtccg tccatctgcc 3732 

cagtggcctt gggagaacgg aaccgccctg gccgtccagc ccagccctgt ctccctgcca 3792 

gccgcgcaca ggctgtcccc ggcatgtccc aggagtggag tggcctctgt cagggagacc 3852 

gcctgtgcgc ggggtccccc ttaggagcgt ccaggtatca cccagggcct gacaacagag 3912 

gctgggcagg cggggacggt tggtggggtc tcaggcctgt gcccagctga caggctcctc 3972 

gccggcttct cctcag gg atg ctt gag tac gaa ccg gcc aag agg ttc tec 4023 
Gly Het Leu Glu Tyr Glu Pro Ala Lys Arg Phe Ser 
290 295 

ate egg cag ate egg cag cac ag gtgagcggccc ctgggggcag tggggecgag 4077 
He Arg Gin He Arg Gin His Ser 
300 305 

getgeaggga ggccggccat gtgggcagct ggttgagcgg gegctagage agggcgtggt 4137 

gggggtgcca ggctgggctg gggccagacc ccgtgcagcg cccgcagttc tcggggcccg 4197 

agtggggtct ctgggcagtg tcctgttacc ggccagaccc aggegecttg tccgaaetgg 4257 

ggtctgagtg aggacatgeg tccgtccctg ccctaggcat ggagatgege caggaagggc 4317 

acagctggtc ccaaacactg gegagagect ctctttttcc cctcctcctg gggctcccag 4377 

cagcagggtg tggctgggat ccagcccagg gcccccagct ccatgacagg gaagacagag 4437 

cageggaegg ggtcagcagg ccccacagtg ccgcctccct cacttcgtgg gctctgctcc 4497 
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tctgcaccag cccctggagg cccttgagcc gtctgctgga gcccctccga gccccgaggc 4557 

cacccactga gaccggctct gggagtggga gtgtccggac ccctgaggcg ctggtgctga 4617 

ttgtgccttg ggggtctctg cacagctcgg gtcatctggg cgcctggcgg ggactggggc 4677 

tgccccccga tagcctcctg ggctgggatg tgctcagggc cccccagacc cccttctggc 4737 

ctttgctggc tttgcagcca gcatccatct ggtgggtgct ggcttctgag tgccacctgg 4797 

gacacaggcc tcagggtgga ggggacatct gtcaggcttg gagtcaggtc agcctgcctg 4857 

ctcctagagg acatggctga gcttctgtgg tcacagccac cccttgcacg gcctggtccc 4917 

agctcctgag tgtgtggcag gtaccctggg cccagaggag ctgggtcgga aaactggacc 4977 

gccctggtgc cagcctgaca ggcgccactg cttctgggcg tttgcag c tgg ttc egg 5034 

Trp Phe Arg 
310 

aag aaa cat cct ccg get gaa gca cca gtg ccc ate cca ccg age cca 5082 
Lys Lys His Pro Pro Ala Glu Ala Pro Vai Pro lie Pro Pro Ser Pro 
315 320 325 

gac ace aag gac egg tgg cgc age atg act gtg gtg ccg tac ttg gag 5130 
Asp Thr Lys Asp Arg Trp Arg Ser Met Thr Val Val Pro Tyr Leu Glu 
330 335 " 340 

gac ctg cac ggc gcg gac gag gac gag gac etc ttc gac ate gag gat 5178 
Asp Leu His Gly Ala Asp Glu Asp Glu Asp Leu Phe Asp He Glu Asp 
345 350 355 

gac ate ate tac act cag gac ttc acg gtg ccc g gtgagtctggcg 5224 
Asp He He Tyr Thr Glu Asp Phe Thr Val Pro 
360 365 

45 ggggcccctg cccggctctg ctgactcggc caggatgtcc caegggagea gggtgcctgc 5284 

ctgtctgcaa caaggacagc ttctgccctc tggtggccaa tcccacgtcc ccaaagcctc 5344 

so cagcccacct gcaggctgcc tccgccctgc gggccgctgg gacatggctg aaaggtgtgg 5404 

ggtcagcggg ggcaccagcc caggcctgtc tggecaggag ggttcctcag gcgtctctcc 5464 

55 
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gggtgctgcc cagccaggca ccacccaccg gccttggcct gagtcccagc aggagcaggc 5524 

gggggagccc cagggtcggg ggagggtagg tgagagtcag ggtgcagggt ggcccctcag 5584 

acagctggca tgagagaggg tccagtggcc ctccctcccg tcgtccctga ggcctgcccg 5644 

ctggccctga tgccggc 
5661 



15 <210> 4 

<211> 2095 
<212> DNA 

<213> Hobo sapiens 

20 

<220> 

<221> intron 
<222> (1)..(454) 

25 

<220> 

<221> exon 

<222> (455).. (2095) 

30 

<400> 4 

tttcgcgtgc ctggcctgag cctggcccga gcctggccct cctgtgtcct cacagatgag 60 
35 catgtggcgg ctcctgggcc tctagaacca accatgggcc agggtgcccc aggggagcac 120 

gggagggtcc tgccttgtca gcttgcctcc tactcgtgag gttcctgcag tcagtacctg 180 
40 ggtggggtcc cacctgcggc catggcaggt gcaacagacg tggtggaggg gacactcctg 240 

cccaggccat ctgcgggagg ctcagccccg gggggtgcct cccagagctg ctggggggca 300 
45 gcatttcagg ctggatacac ctgggcctga cccgggggcg ggcatggcct gggcagcagc 360 

tgtaagtgcg tccccgtggt gggggccagc caggtccctg tggctctggg gttgcgcccc 420 

so tcagctcagg ccacacttgc cgtctccctc ccag ga cag gtc cca gaa gag gag 474 

Gly Gin Val Pro Glu Glu Glu 
370 375 
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5 


gee agt cac aat gga cag cgc egg ggc etc ccc aag gec gtg tgt atg 
Ala Ser His Asn Gly Gin Arg Arg Gly Leu Pro Lys Ala Val Cys Met 
^80 385 390 

JUv XJsJtJ vw 


522 


10 


aac ggc aca gag gcg gcg cag ctg age acc aaa tec agg gcg gag ggc 
Asn Gly Thr Glu Ala Ala Gin Leu Ser Thr Lys Ser Arg Ala Glu Gly 
395 400 405 


570 


15 


r\*rtT rr/> r» r»nr* gap r*P+ if Of* Pttt* aoff (7PP toY* + PP <TPf* afTP flC'P OQff fti"P 
Cgg gCC CCC oal UvrU glslr l«gl* i"*5 gv*^ vgC ll«L gv-U agi* afit aOfe Out 

Arg Ala Pro Asn Pro Ala Arg Lys Ala Cys Ser Ala Ser Ser Lys He 

Ain Alt* A9(i 
*tlU rrltf t&u 


618 


20 


cgc Cgg ctg teg gcc T/gc aag cag cag igaggiiggi tgtLigtd^g 
Arg Arg Leu Ser Ala Cys Lys Gin Gin 


665 




^gggSCgcgg cggggcccgg gxggggcatg Lggggacaac gccLggaugo witagti/dgo 


725 




Cgxgagc a ua gcccgcgcua gxcogucatg girgaccgi*ca iguggiigig cguggi*LgLu 


785 


25 


atgtggccT.t xgggxggciu ggecaegtag cgaLimgL gsagggigi-i b^^&sss^ 


845 




r»+ tro+cr+ r»+ tr or#» p o <tp <r+ <rp +pv+na"f"PVja f?fTPP"fcjlPirf"l? i"I?frPf?(?l?l?pt pt.finri?firf?l??P, 
CLggLgLCl/g gCCagCglgL wggtuaugga ggtutatg^s i* ul 666oB66^ 


905 


30 


ir+<rrpp+p/»+ /»ftr»<»«'PCJirc t"P"hP»irajrf't? ffi?+f?PJl'M"f*P ^M?fiTaCCC"t? CCCt^ETfiTCCt 


965 




ppT^prprpt rrppftl'^prp irpi?cc£cttc cajr^amLKc ttatifCTjnnr ctcai? ccca£ 


1025 


35 


a^ertttt^a ffcaccartffj? fftefrtearte ztzzzzazzz £cc£C££ect ccatffgctct 


1085 




pppff^M+^p ppraff'flrrtr'i' ir»fi?pp.airctir ccaajrta."t££ c tfTRPfTP tjra utCjrtfTCCfrJI 


1145 


40 


ap^pf^pppt pfpfptpppf rtrt^cctffc ctcctctccc acccccaffcc ccaeccccffE 


1205 




gtgggagacg gagtcccaga ggtgtcagag acccttaagt cacctgccga ggatgcgggg 


1265 


45 


tggatggggg cccgaggctg aagcccctgc cttgccacag cccctctccc aggttttggg 


1325 




ggccaccgcc tgagttacat gtctgtcccc caaatgggtg cccacagccc atccaccagc 


1385 


50 


gtcagagccc gccaggcccc actgeaaaag gccacacaat gtaccccggg agtgactcaa 


1445 




gggtggcctt ccctggcctc ccctgctgcc ccccaggagt ccggtagccc catgactgta 


1505 
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pp+pflfpf f p tccxtcotcc 






PP PJlJfP'P'hfrA 


1565 


5 


pp4/P"f"f"PPtrf~ /* "t"f* r» /»+ "I* pp ft 


P P P 1* ^Cfl^P P 


pjrhfrt PPftlTl? ftl?PPPPl?PP» 




1S25 




ccaggcccxc agxcLtccLg 


r» r» crcr4- 4" /» <y r* 
vCgg U LCLgU 


ccgccctccc ggagaggtgg 




1DOU 


10 


xctgrgccga ccacgcccca 


ggacc LCCgg 


agCgCCCLgC aggg C CgggC 


agggggacag 


1 r^o 




cagggaccgg gcgcagccc x 


CCCCCCXCgg 


CCgCCCggCa gXgCaCgCgg 




loUD 


15 


tcgcagcccc gggcggagcc 


ttcccgggcg 


ggcgtgggag gagggaggcg 


gcctccatgc 


1865 




actttatgtg gagactactg 


gccccgcccg 


tggcctcgtg ctccgcaggg 


cgcccagcgc 


1925 


20 


cgtccggcgg ccccgccgca 


gaccagctgg 


cgggtgtgga gaccaggctc 


ctgaccccgc 


1985 




catgcatgca gcgccacctg 


gaagccgcgc 


er rr />/»*r , /*4" 4~4" w tT4*4~4" 4~ 4^4" art 4* 
ggCCgCXXXg gXXXXXXgTL 


LggXLggXXC 




25 


cattttcttt ttttcttttt 


ttttttaaga 


aaaaataaaa ggtggatttg 




2095 


30 


<210> 5 
<211> 1302 
<212> DMA 
<213> Homo sapiens 










35 


<220> 
<221> CDS 











<222> (1)..(1302) 
<400> 5 

atg gag gtg gtg gac ccg cag cag ctg ggc atg ttc acg gag ggc gag 48 
Met Glu Val Val Asp Pro Gin Gin Leu Gly Met Phe Thr Glu Gly Glu 
15 10 15 

ctg atg teg gtg ggt atg gac acg ttc ate cac cgc ate gac tec ace 96 
Leu Met Ser Val Gly Met Asp Thr Phe He His Arg He Asp Ser Thr 
20 25 30 

gag gtc ate tac cag ccg cgc cgc aag egg gec aag etc ate ggc aag 144 
Glu Val He Tyr Gin Pro Arg Arg Lys Arg Ala Lys Leu He Gly Lys 
35 40 45 
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10 



tac ctg atg ggg gac ctg ctg ggg gaa ggc tct tac ggc aag gtg aag 192 
Tyr Leu Met Gly Asp Leu Leu Gly Glu Gly Ser Tyr Gly Lys Val Lys 
50 55 60 

gag gtg ctg gac teg gag acg ctg tgc agg agg gec gtc aag ate etc 240 
Glu Val Leu Asp Ser Glu Thr Leu Cys Arg Arg Ala Val Lys He Leu 
65 70 75 80 



15 



20 



25 



30 



35 



40 



45 



50 



aag aag aag aag ttg cga agg ate ccc aac ggg gag gec aac gtg aag 288 
Lys Lys Lys Lys Leu Arg Arg lie Pro Asn Gly Glu Ala Asn Val Lys 
85 90 95 

aag gaa att caa eta ctg agg agg tta egg cac aaa aat gtc ate cag 336 
Lys Glu He Gin Leu Leu Arg Arg Leu Arg His Lys Asn Val He Gin 
100 105 110 

ctg gtg gat gtg tta tac aac gaa gag aag cag aaa atg tat atg gtg 384 
Leu Val Asp Val Leu Tyr Asn Glu Glu Lys Gin Lys Met Tyr Met Val 
115 120 125 

atg gag tac tgc gtg tgt ggc atg cag gaa atg ctg gac age gtg ccg 432 
Met Glu Tyr Cys Val Cys Gly Met Gin Glu Met Leu Asp Ser Val Pro 
130 135 140 

gag aag cgt ttc cca gtg tgc cag gec cac ggg tac ttc tgt cag ctg 480 
Glu Lys Arg Phe Pro Val Cys Gin Ala His Gly Tyr Phe Cys Gin Leu 
145 150 155 160 

att gac ggc ctg gag tac ctg cat age cag ggc att gtg cac aag gac 528 
He Asp Gly Leu Glu Tyr Leu His Ser Gin Gly lie Val His Lys Asp 
165 170 _ 175 

ate aag ccg ggg aac ctg ctg etc ace acc ggt ggc acc etc aaa ate 576 
lie Lys Pro Gly Asn Leu Leu Leu Thr Thr Gly Gly Thr Leu Lys He 
180 185 190 

tec gac ctg ggc gtg gee gag gca ctg cac ccg ttc gcg gcg gac gac 624 
Ser Asp Leu Gly Val Ala Glu Ala Leu His Pro Phe Ala Ala Asp Asp 
195 200 205 

acc tgc egg acc age cag ggc tec ccg get ttc cag ccg ccc gag att 672 
Thr Cys Arg Thr Ser Gin Gly Ser Pro Ala Phe Gin Pro Pro Glu He 
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210 



215 



220 



gcc aac ggc ctg gac acc ttc tec ggc ttc aag gtg gac ate tgg teg 
Ala Asn Gly Leu Asp Thr Phe Ser Gly Phe Lys Val Asp He Trp Ser 
225 230 235 240 



720 



10 



get ggg gtc acc etc tac aac ate acc acg ggt ctg tac ccc ttc gaa 
Ala Gly Val Thr Leu Tyr Asn He Thr Thr Gly Leu Tyr Pro Phe Glu 
245 250 255 



768 



15 



ggg gac aac ate tac aag ttg ttt gag aac ate ggg aag ggg age tac 
Gly Asp Asn lie Tyr Lys Leu Phe Glu Asn He Gly Lys Gly Ser Tyr 
260 265 270 



816 



20 



gcc ate ccg ggc gac tgt ggc ccc ccg etc tct gac ctg ctg aaa ggg 
Ala He Pro Gly Asp Cys Gly Pro Pro Leu Ser Asp Leu Leu Lys Gly 
275 280 285 



864 



25 



30 



atg ctt gag tac gaa ccg gcc aag agg ttc tec ate egg cag ate egg 912 
Met Leu Glu Tyr Glu Pro Ala Lys Arg Phe Ser He Arg Gin lie Arg 
290 295 300 

cag cac age tgg ttc egg aag aaa cat cct ccg get gaa gca cca gtg 960 
Gin His Ser Trp Phe Arg Lys Lys His Pro Pro Ala Glu Ala Pro Val 
305 310 315 320 



35 



ccc ate cca ccg age cca gac acc aag gac egg tgg cgc age atg act 
Pro He Pro Pro Ser Pro Asp Thr Lys Asp Arg Trp Arg Ser Met Thr 
325 330 335 



1008 



45 



gtg gtg ccg tac ttg gag gac ctg cac ggc gcg gac gag gac gag gac 1056 
Val Val Pro Tyr Leu Glu Asp Leu His Gly Ala Asp Glu Asp Glu Asp 
340 345 350 

etc ttc gac ate gag gat gac ate ate tac act cag gac ttc acg gtg 1104 
Leu Phe Asp lie Glu Asp Asp He He Tyr Thr Gin Asp Phe Thr Val 
355 360 365 



ccc gga cag gtc cca gaa gag gag gcc agt cac aat gga cag cgc egg 
Pro Gly Gin Val Pro Glu Glu Glu Ala Ser His Asn Gly Gin Arg Arg 
370 375 380 



1152 



ggc etc ccc aag gcc gtg tgt atg aac ggc aca gag gcg gcg cag ctg 



1200 



55 
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10 



15 



Gly leu Pro Lys Ala Val Cys Met Asn Gly Thr Glu Ala Ala Gin Leu 
385 390 395 400 

age acc aaa tec agg gcg gag ggc egg gec ccc aac cct gee cgc aag 1248 
Ser Thr Lys Ser Arg Ala Glu Gly Arg Ala Pro Asn Pro Ala Arg Lys 
405 410 415 

gec tgc tec gee age age aag ate cgc egg ctg teg gee tgc aag cag 1296 
Ala Cys Ser Ala Ser Ser Lys lie Arg Arg Leu Ser Ala Cys Lys Gin 
420 425 430 

cag tga 
1302 
Gin 

20 <210> 6 

<211> 433 
<212> PRT 
<213> Homo sapiens 

<400> 6 

Met Glu Val Val Asp Pro Gin Gin Leu Gly Met Phe Thr Glu Gly Glu 
15 10 15 

Leu Met Ser Val Gly Met Asp Thr Phe He His Arg He Asp Ser Thr 
20 25 30 

Glu Val lie Tyr Gin Pro Arg Arg Lys Arg Ala Lys Leu He Gly Lys 
35 40 45 

Tyr Leu Met Gly Asp Leu Leu Gly Glu Gly Ser Tyr Gly Lys Val Lys 
50 55 60 

Glu Val Leu Asp Ser Glu Thr Leu Cys Arg Arg Ala Val Lys He Leu 
65 70 75 80 

45 Lys Lys Lys Lys Leu Arg Arg He Pro Asn Gly Glu Ala Asn Val Lys 

85 90 95 



25 



30 



35 



40 



50 



55 



Lys Glu lie Gin Leu Leu Arg Arg Leu Arg His Lys Asn Val He Gin 
100 105 110 

Leu Val Asp Val Leu Tyr Asn Glu Glu Lys Gin Lys Met Tyr Met Val 
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115 



120 



125 



Met Glu Tyr Cys Val Cys Gly Met Gin Glu Met Leu Asp Ser Val Pro 
130 135 140 

Glu Lys Arg Pfae Pro Val Cys Gin Ala Bis Gly Tyr Phe Cys Gin Leu 
145 150 155 160 

He Asp Gly Leu Glu Tyr Leu His Ser Gin Gly lie Val His Lys Asp 
165 170 175 

He Lys Pro Gly Asn Leu Leu Leu Thr Thr Gly Gly Thr Leu Lys He 
180 185 190 

Ser Asp Leu Gly Val Ala Glu Ala Leu His Pro Phe Ala Ala Asp Asp 
195 200 205 

Thr Cys Arg Thr Ser Gin Gly Ser Pro Ala Phe Gin Pro Pro Glu He 
210 215 220 

Ala Asn Gly Leu Asp Thr Phe Ser Gly Phe Lys Val Asp He Trp Ser 
225 230 235 240 

Ala Gly Val Thr Leu Tyr Asn He Thr Thr Gly Leu Tyr Pro Phe Glu 
245 250 255 

Gly Asp Asn He Tyr Lys Leu Phe Glu Asn He Gly Lys Gly Ser Tyr 
260 265 270 

Ala lie Pro Gly Asp Cys Gly Pro Pro Leu Ser Asp Leu Leu Lys Gly 
275 280 285 

Met Leu Glu Tyr Glu Pro Ala Lys Arg Phe Ser He Arg Gin He Arg 
290 295 300 

Gin His Ser Trp Phe Arg Lys Lys His Pro Pro Ala Glu Ala Pro Val 
305 310 315 320 

Pro He Pro Pro Ser Pro Asp Thr Lys Asp Arg Trp Arg Ser Met Thr 
325 330 335 



Val Val Pro Tyr Leu Glu Asp Leu His Gly Ala Asp Glu Asp Glu Asp 
340 345 350 
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Leu Phe Asp He Glu Asp Asp He lie Tyr Thr Gin Asp Phe Thr Val 
355 360 365 

Pro Gly Gin Val Pro Glu Glu Glu Ala Ser His Asn Gly Gin Arg Arg 
370 375 380 

Gly Leu Pro Lys Ala Val Cys Met Asn Gly Thr Glu Ala Ala Gin Leu 
385 390 395 400 

Ser Thr Lys Ser Arg Ala Glu Gly Arg Ala Pro Asn Pro Ala Arg Lys 
405 410 415 

Ala Cys Ser Ala Ser Ser Lys He Arg Arg Leu Ser Ala Cys Lys Gin 
420 425 430 

Gin 



<210> 7 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: "DJ698*, an artificially 
synthesized priner sequence 

<400> 7 

ggtccccgag gacgaagttg a 21 

<210> 8 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: "DJ673", an artificially 
synthesized primer sequence 

<400> 8 

accatcagca ccgtgactgg 20 
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<210> 9 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthesized primer sequence 

<400> 9 

tcgccggccg atgacaga 

<210> 10 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthesized priner sequence 

<400> 10 

aaggagacgg gaagaggagc ag 

<210> 11 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthesized primer sequence 

<400> 11 

gaggaggggc aaggtgggt 

<210> 12 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 



Sequence: "DJ703", an artificially 

18 

Sequence: "DJ674", an artificially 

22 

Sequence: "DJ690", an artificially 

19 

Sequence: "DJ680", an artificially 
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synthesized primer sequence 
<400> 12 

gtgtggcctc acggaaagga g 21 

<210> 13 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: "DJ692", an artificially 
synthesized primer sequence 



<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: "DJ694", an artificially 
synthesized primer sequence 



<210> 15 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: "DJ695", an artificially 
synthesized primer sequence 



<400> 13 

agctgggcct gtggtgtttg 



20 



<400> 14 

cagaggcccc tcggagtgtg 



20 



<400> 15 

gcctctgtcc ctggggtaga 



20 



<210> 16 
<211> 22 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthesized primer sequence 

<400> 16 

tcagtcctct caatgcctgc tg 

<210> 17 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthesized primer sequence 

<400> 17 

gcggggtccc ccttaggag 

<210> 18 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthesized primer sequence 

<400> 18 

ctagcgcccg ctcaaccag 

<210> 19 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthesized primer sequence 



Sequence: "DJ693", an artificially 



22 



Sequence: *DJ696*\ an artificially 



19 



Sequence: a, DJ697" 3 an artificially 



19 



Sequence: "DJ675", an artificially 
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<400> 19 

ggagctgggt cggaaaactg ga 



22 



<210> 20 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: "DJ702", an artificially 
synthesized priaer sequence 



<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: "DJ676", an artificially 
synthesized priaer sequence 



<210> 22 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: "DJ67T, an artificially 
synthesized priaer sequence 



<400> 20 

tgctcccgtg ggacatcctg 



20 



<400> 21 

gtaagtgcgt ccccgtggtg 



20 



<400> 22 

gtggcatcca ggcgttgtcc 



20 



<210> 23 
<211> 22 
<212> DNA 



<213> Artificial Sequence 
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<220> 

<223> Description of Artificial 
synthesized primer sequence 

<400> 23 

gggaattcgg aacacaagga ag 

<210> 24 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthesized primer sequence 

<400> 24 

atggaggtgg tggacccgc 

<210> 25 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthesized primer sequence 

<400> 25 

gttacggcac aaaaatgtca tcca 

<210> 26 
<2U> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthesized primer sequence 

<400> 26 

ggtgatggag tactgcgtgt g 



Sequence: "DJ705", an artificially 

22 

Sequence: "DJ649*, an artificially 

19 

Sequence: "DJ659", an artificially 

24 

Sequence: "DJ666", an artificially 
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<210> 27 
<211> 21 
<212> DM 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthesized primer sequence 

<400> 27 

acatcgggaa ggggagctac g 

<210> 28 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthesized primer sequence 

<400> 28 

ccgggcaccg tgaagtcctg 

<210> 29 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthesized primer sequence 

<400> 29 

tcactgctgc ttgcaggcc 

<210> 30 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 



Sequence: "DJ684", an artificially 

21 

Sequence: "DJ660", an artificially 

20 

Sequence: "DJ650", an artificially 
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<223> Description of Artificial Sequence: "DJ717", an artificially 
synthesized priaer sequence 

<400> 30 

gcaggcggcc agcctca 17 

<210> 31 

<211> 19 

<212> PRT 

<213> Hobo sapiens 

<400> 31 

Cys Leu Ser Thr Lys Ser Arg Ala Glu Gly Arg Ala Pro Asn Pro Ala 
1 5 10 IS 

Arg Lys Ala 



<210> 32 

<211> 20 

<212> PRT 

<213> Hobo sapiens 

<400> 32 

Cys He Arg Gin He Arg Gin His Ser Trp Phe Arg Lys Lys His Pro 
15 10 15 

Pro Ala Glu Ala 
20 

<210> 33 
<211> 20 
<212> PRT 

<213> Hobo sapiens ■ 
<400> 33 

Cys Phe Glu Gly Asp Asn He Tyr Lys Leu Phe Glu Asn He Gly Lys 
15 10 15 

Gly Ser Tyr Ala 
20 

<210> 34 
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<211> 20 
<212> PET 

<213> Homo sapiens 
<400> 34 

Cys His Arg He Asp Ser Thr Glu Val lie Tyr Gin Pro Arg Arg Lys 
15 10 15 

Arg Ala Lys Leu 
20 

<210> 35 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: "DJ180", an artificially 
synthesized primer sequence 

<400> 35 

cgactcacta tagggagacc ca 22 

<210> 36 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: "DJ181", an artificially 
synthesized primer sequence 

<4O0> 36 

cctcgagaat taccctcact aa 22 

<210> 37 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially synthesized 
primer sequence 
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<400> 37 

tggatctaca ctccggc 17 

<210> 38 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: an artificially synthesized 
primer sequence 

<400> 38 

attttactgg ctggcacttg 20 



<210> 39 
<211> 33 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: "IK El", an artificially 
synthesized priner sequence 

<400> 39 

gatgaattcg ggtccagcat ggaggtggtg gac 33 

<210> 40 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: "LK E2", an artificially 
synthesized primer sequence 

<400> 40 

gatgaattct tagaggtctt cttctgagat gagcttctgc tcctgctgct tgcaggccga 60 



<210> 41 
<211> 10 
<212> PET 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially synthesized 
c-Myc partial sequence 

<400> 41 

Glu Gin Lys Leu lie Ser Glu Glu Asp Leu 
1 5 10 

<210> 42 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: "LK E4", an artificially 
synthesized prime r sequence 

<400> 42 

gatgggccct tacagggagg catagtcagg cacatcatat gggtactgct gcttgcaggc 60 
cga 63 

<210> 43 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: "LK E5", an artificially 
synthesized priner sequence 

<400> 43 

gatgaattct tagtgatggt gatggtgatg ctgctgcttg caggccga 48 

<210> 44 
<211> 11 
<212> PET 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially synthesized HA 
partial sequence 
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<400> 44 

Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Leu 
1 5 10 



<210> 45 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: *LK D176N", an artificially 
synthesized primer sequence 



<210> 46 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: "LK W308C", an artificially 
synthesized priaer sequence 



<210> 47 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: "II L67P", an artificially 
synthesized primer sequence 



<400> 45 

attgtgcaca agaacatcaa gccgggg 



27 



<400> 46 

cggcagcaca gctgcttccg gaagaaa 



27 



<400> 47 

gtgaaggagg tgccggactc ggagacg 



27 



<210> 48 
<211> 27 
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<212> DNA 

<213> Artificial Sequence 

5 

<220> 

<223> Description of Artificial Sequence: "LK Ell", an artificially 
synthesized primer sequence 

10 

<400> 48 

aggagggccg tcatcatcct caagaag 27 

15 



Claims 

20 

I . A primer DNA used for diagnosing a disease caused by mutation in the LKB1 gene, the primer DNA comprising a 
nucleotide sequence containing at least a portion of any one of the nucleotide sequences shown in SEQ ID NOs: 
1 to 4. 

25 2. The primer DNA according to claim 1 , wherein the primer DNA has a nucleotide sequence corresponding to any 
one of the nucleotide sequences shown in SEQ ID NOs: 7 to 30. 

3. The primer DNA according to claim 1 or 2, wherein the disease caused by mutation in the LKB1 gene is Peutz- 
Jeghers syndrome. 

30 

4. A probe DNA used for diagnosing a disease caused by mutation in the LKB1 gene, the probe DNA comprising a 
nucleotide sequence containing at least a portion of any one of the nucleotide sequences shown in SEQ ID NOs: 
1 to 4. 

35 5. The probe DNA according to claim 4, wherein the disease caused by mutation in the LKB1 gene is Peutz-Jeghers 
syndrome. 

6. A therapeutic preparation for a disease caused by mutation in the LKB1 gene, the preparation comprising the LKB1 
gene as an active ingredient. 

40 

7. A therapeutic preparation for a disease caused by mutation in the LKB1 gene, the preparation comprising the LKB1 
protein an active ingredient. 

8. A therapeutic preparation for a disease caused by mutation in the LKB1 gene, the preparation comprising a com- 
45 pound that enhances the activity of LKB1 protein as an active ingredient. 

9. The therapeutic preparation according to claims 6 to 8, wherein the disease caused by mutation in the LKB1 gene 
is Peutz-Jeghers syndrome. 

so 10. A reagent for diagnosing a disease caused by mutation in the LKB1 gene, the reagent comprising an antibody that 
binds to the LKB1 protein as an active ingredient. 

II. The reagent according to claim 10, wherein the disease caused by mutation in the LKB1 gene is Peutz-Jeghers 
syndrome. 

55 

12. A method of diagnosing a disease caused by mutation in the LKB1 gene, the method comprising detecting muta- 
tion in the LKB1 gene. 
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13. A method of diagnosing a disease caused by mutation in the LKB1 gene, the method comprising the steps of: 

(a) preparing a DNA sample from a patient; 

(b) amplifying the DNA using the primer DNA according to (1) ; 
5 (c) cleaving the amplified DNA; 

(d) fractionating the DNA fragments according to their size; 

(e) hybridizing the probe DNA according to (4) with the fractionated DNA fragments; and 

(f) comparing the size of the DNA fragment thus detected to that from a control of a healthy subject. 

10 14. A method of diagnosing a disease caused by mutation in the LKB1 gene, the method comprising the steps of: 

(a) preparing a RNA sample from a patient; 

(b) fractionating the RNA sample depending on its size; 

(c) hybridizing the probe DNA according to (4) with the RNA thus fractionated; 

is (d) comparing the size of the RNA thus detected to that from a control of a healthy subject. 

1 5. A method of diagnosing a disease caused by mutation in the LKB1 gene, the method comprising the steps of: 

(a) preparing a DNA sample from a patient; 

(b) amplifying the DNA using the primer DNA according to (1) ; 

(c) separating the amplified DNA into single stranded DNA; 

(d) fractionating the separated single stranded DNA on the non-denatured gel; 

(e) comparing the mobility of the single stranded DNA separated on the non-denatured gel to that of a control 
of a healthy subject. 

16. A method of diagnosing a disease caused by mutation in the LKB1 gene, the method comprising the steps of: 

(a) preparing a DNA sample from a patient; 

(b) amplifying the DNA using the primer DNA according to (1); 

30 (c) fractionating the amplified DNA on the DNA denatured gradient gel; 

(d) comparing the mobility of the fractionated DNA on the gel to that of a control of a healthy subject. 

17. The method according to any one of claims 12 to 16, wherein the disease caused by mutation in the LKB1 gene is 
Peutz-Jeghers syndrome. 

35 
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Figure 1 
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Figure 2 
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Figure 3 
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Figure 4 
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Figure 5 
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Figure 6 
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Figure 7 
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Figure 8 



Anti-Myc Antibody Anti-LKBl Antibody 



M123 M123 
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Figure 9 sji 
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Figure 10 
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Figure 11 
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Figure 12 
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